Explanipedia

PSR: Scaling Multi-Subject Personalized Image Generation with Pairwise Subject-Consistency Rewards Open

Shulei Wang, Xin He, Qi Tian · 2025

Personalized generation models for a single subject have demonstrated remarkable effectiveness, highlighting their significant potential. However, when extended to multiple subjects, existing models often exhibit degraded performance, part…

Observations on the Efficacy and Safety of FRED™ Jr FD for the Treatment of Intracranial Unruptured Distal Aneurysms: A Single-Center Experience Open

Qi Tian, Seisyou Kou, Shuailong Shi, Shuhai Long, Yinglong Hou , et al. · 2025

OKM＝O'Kelly-Marotta grading scale.

Fog computing based cost optimization for university governance Open

Qi Tian, Wenbin Li · 2025

This study presented a new architecture based on fog computing to effectively reduce the burdensome cost of university governance. The process is established by enhancing network performance and optimizing resource utilization. The solutio…

Snap-Snap: Taking Two Images to Reconstruct 3D Human Gaussians in Milliseconds Open

Jia Lü, Taoran Yi, Jiemin Fang, Yang Chen, C. Wu , et al. · 2025

Reconstructing 3D human bodies from sparse views has been an appealing topic, which is crucial to broader the related applications. In this paper, we propose a quite challenging but valuable task to reconstruct the human body from only two…

Bone marrow adipocytes: key players in vascular niches, aging, and disease Open

Yonggang Fan, Mai Kamal Abd El-Khalek, Yuheng Zhang, Lu Liu, Qi Tian , et al. · 2025

Bone marrow adipocytes (BMAs) are emerging as metabolically active endocrine organs within the bone marrow microenvironment, engaging in extensive crosstalk with vascular niches, osteogenic cells, and hematopoietic compartments. In aging a…

Efficient Multi-modal Long Context Learning for Training-free Adaptation Open

Zehong Ma, Shiliang Zhang, Longhui Wei, Qi Tian · 2025

Traditional approaches to adapting multi-modal large language models (MLLMs) to new tasks have relied heavily on fine-tuning. This paper introduces Efficient Multi-Modal Long Context Learning (EMLoC), a novel training-free alternative that…

Infinite-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation Open

Qihua Chen, Yue Ma, Hongfa Wang, Junkun Yuan, Wenzhe Zhao , et al. · 2025

This paper explores higher-resolution video outpainting with extensive content generation. We point out common issues faced by existing methods when attempting to largely outpaint videos: the generation of low-quality content and limitatio…

GraphATC: advancing multilevel and multi-label anatomical therapeutic chemical classification via atom-level graph learning Open

Wengyu Zhang, Qi Tian, Yi Cao, Wenqi Fan, Dongmei Jiang , et al. · 2025

The accurate categorization of compounds within the anatomical therapeutic chemical (ATC) system is fundamental for drug development and fundamental research. Although this area has garnered significant research focus for over a decade, th…

Biomechanical insights and optimization in the teaching design of badminton games based on motion capture and adaptive virtual reality video coding Open

Qi Tian, Jiping Tang · 2025

The application of games in sports not only brings new development to sports, but also brings new requirements to sports. To maintain and enhance students’ learning motivation and interest, more effective individualized teaching is needed.…

CDG: A semantic–structural controllable framework for crack image generation in complex scenes Open

Tian Qin, Lingxi Xie, Qin Zou, Qi Tian, Qingquan Li · 2025

LiftImage3D: Lifting Any Single Image to 3D Gaussians with Video Generation Priors Open

Yabo Chen, Yang Chen, Jiemin Fang, Xiaopeng Zhang, Lingxi Xie , et al. · 2024

Single-image 3D reconstruction remains a fundamental challenge in computer vision due to inherent geometric ambiguities and limited viewpoint information. Recent advances in Latent Video Diffusion Models (LVDMs) offer promising 3D priors l…

Advances in Plant GABA Research: Biological Functions, Synthesis Mechanisms and Regulatory Pathways Open

Yixuan Hu, Xin Huang, Qin Xiao, Xuan Wu, Qi Tian , et al. · 2024

The γ-aminobutyric acid (GABA) is a widely distributed neurotransmitter in living organisms, known for its inhibitory role in animals. GABA exerts calming effects on the mind, lowers blood pressure in animals, and enhances stress resistanc…

Frailty increases depression risk independently of cognitive decline: Insights from Mendelian randomization and cross-sectional analysis Open

Wenjie Li, Qi Tian, Jingxi Duan, Xintong Liu, Jianwei Shou , et al. · 2024

We provide evidence that frailty could increase depression risk independently of cognitive decline. Further research with a larger sample size is necessary.

Follow-Your-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation Open

Qihua Chen, Yue Ma, Hongfa Wang, Junkun Yuan, Wenzhe Zhao , et al. · 2024

This paper explores higher-resolution video outpainting with extensive content generation. We point out common issues faced by existing methods when attempting to largely outpaint videos: the generation of low-quality content and limitatio…

Predicting the sea lamprey population dynamic and ecosystem stability based on sex ratio corrected logistic model Open

Qihao Ren, Qi Tian, Xinghan Wang · 2024

The sex ratio of the sea lamprey(Petromyzon marinus) is influenced by population density. A sex ratio corrected population model and a ecosystem stability model were established based on the logistic population model. In these models, the …

Segment Any 4D Gaussians Open

Shengxiang Ji, Guanjun Wu, Jiemin Fang, Jiazhong Cen, Taoran Yi , et al. · 2024

Modeling, understanding, and reconstructing the real world are crucial in XR/VR. Recently, 3D Gaussian Splatting (3D-GS) methods have shown remarkable success in modeling and understanding 3D scenes. Similarly, various 4D representations h…

SDPT: Semantic-Aware Dimension-Pooling Transformer for Image Segmentation Open

Hu Cao, Guang Chen, Hengshuang Zhao, Dongsheng Jiang, Xiaopeng Zhang , et al. · 2024

Image segmentation plays a critical role in autonomous driving by providing vehicles with a detailed and accurate understanding of their surroundings. Transformers have recently shown encouraging results in image segmentation. However, tra…

A Survey of Generative Techniques for Spatial-Temporal Data Mining Open

Qianru Zhang, Haixin Wang, Long Cheng, Liangcai Su, Xingwei He , et al. · 2024

This paper focuses on the integration of generative techniques into spatial-temporal data mining, considering the significant growth and diverse nature of spatial-temporal data. With the advancements in RNNs, CNNs, and other non-generative…

Predicting dust pollution from dry bulk ports in coastal cities: A hybrid approach based on data decomposition and deep learning Open

Wenyuan Wang, Bochi Liu, Qi Tian, Xinglu Xu, Yun Peng , et al. · 2024

Dust pollution from storage and handling of materials in dry bulk ports seriously affects air quality and public health in coastal cities. Accurate prediction of dust pollution helps identify risks early and take preventive measures. Howev…

Visual Tuning Open

Bruce X. B. Yu, Jianlong Chang, Haixin Wang, Lingbo Liu, Shijie Wang , et al. · 2024

Fine-tuning visual models has been widely shown promising performance on many downstream visual tasks. With the surprising development of pre-trained visual foundation models, visual tuning jumped out of the standard modus operandi that fi…

AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic Segmentation Open

Jiannan Ge, Lingxi Xie, Hongtao Xie, Pandeng Li, Xiaopeng Zhang , et al. · 2024

A serious issue that harms the performance of zero-shot visual recognition is named objective misalignment, i.e., the learning objective prioritizes improving the recognition accuracy of seen classes rather than unseen classes, while the l…

BLADE: Enhancing Black-box Large Language Models with Small Domain-Specific Models Open

Haitao Li, Qingyao Ai, Jia Chen, Qian Dong, Zhijing Wu , et al. · 2024

Large Language Models (LLMs) like ChatGPT and GPT-4 are versatile and capable of addressing a diverse range of tasks. However, general LLMs, which are developed on open-domain data, may lack the domain-specific knowledge essential for task…

LION: Implicit Vision Prompt Tuning Open

Haixin Wang, Jianlong Chang, Yihang Zhai, Xiao Luo, Jinan Sun , et al. · 2024

Despite recent promising performances across a range of vision tasks, vision Transformers still have an issue of high computational costs. Recently, vision prompt learning has provided an economical solution to this problem without fine-tu…

Influence of distributed generation on fault characteristics and relay protection of rural distribution network Open

Bo Liu, Weichen Liang, Yajuan Wang, Zhiyu Zhao, Qi Tian , et al. · 2024

High proportion distributed generation affect traditional rural distribution network. In the paper, a typical line model of rural distribution network with high proportion distributed generation was constructed, and the current and voltage…

Deep Watermarking for Deep Intellectual Property Protection: A Comprehensive Survey Open

Yuchen Sun, Li Liu, Nenghai Yu, Yongxiang Liu, Qi Tian , et al. · 2024

When Parameter-efficient Tuning Meets General-purpose Vision-language Models Open

Yihang Zhai, Haixin Wang, Jianlong Chang, Xinlong Yang, Jinan Sun , et al. · 2023

Instruction tuning has shown promising potential for developing general-purpose AI capabilities by using large-scale pre-trained models and boosts growing research to integrate multimodal information for creative applications. However, exi…

Parameter Efficient Fine-tuning via Cross Block Orchestration for Segment Anything Model Open

Zelin Peng, Zhengqin Xu, Zhilin Zeng, Lingxi Xie, Qi Tian , et al. · 2023

Parameter-efficient fine-tuning (PEFT) is an effective methodology to unleash the potential of large foundation models in novel scenarios with limited training data. In the computer vision community, PEFT has shown effectiveness in image c…

One-Bit Supervision for Image Classification: Problem, Solution, and Beyond Open

Hengtong Hu, Lingxi Xie, Xinyue Huo, Richang Hong, Qi Tian · 2023

This article presents one-bit supervision, a novel setting of learning with fewer labels, for image classification. Instead of the training model using the accurate label of each sample, our setting requires the model to interact with the …

Probabilistic Tree-of-thought Reasoning for Answering Knowledge-intensive Complex Questions Open

Shulin Cao, Jiajie Zhang, Jiaxin Shi, Xin Lv, Zijun Yao , et al. · 2023

Large language models (LLMs) are capable of answering knowledge-intensive complex questions with chain-of-thought (CoT) reasoning. However, they tend to generate factually incorrect reasoning steps when the required knowledge is not availa…

AiluRus: A Scalable ViT Framework for Dense Prediction Open

Jin Li, Yaoming Wang, Xiaopeng Zhang, Bowen Shi, Dongsheng Jiang , et al. · 2023

Vision transformers (ViTs) have emerged as a prevalent architecture for vision tasks owing to their impressive performance. However, when it comes to handling long token sequences, especially in dense prediction tasks that require high-res…

Qi Tian YOU? Author Swipe