Yufei Ding
YOU?
Author Swipe
View article: Orders in Chaos: Enhancing Large-Scale MoE LLM Serving with Data Movement Forecasting
Orders in Chaos: Enhancing Large-Scale MoE LLM Serving with Data Movement Forecasting Open
Large-scale Mixture of Experts (MoE) Large Language Models (LLMs) have recently become the frontier open weight models, achieving remarkable model capability similar to proprietary ones. But their random expert selection mechanism introduc…
View article: Mercury: Unlocking Multi-GPU Operator Optimization for LLMs via Remote Memory Scheduling
Mercury: Unlocking Multi-GPU Operator Optimization for LLMs via Remote Memory Scheduling Open
View article: Efficacy and safety of cold snare polypectomy for outpatient treatment of sessile polyps smaller than 10mm
Efficacy and safety of cold snare polypectomy for outpatient treatment of sessile polyps smaller than 10mm Open
View article: PowerMove: Optimizing Compilation for Neutral Atom Quantum Computers with Zoned Architecture
PowerMove: Optimizing Compilation for Neutral Atom Quantum Computers with Zoned Architecture Open
View article: Quantitative computed tomography analysis of bone microarchitecture is associated with rotator cuff healing
Quantitative computed tomography analysis of bone microarchitecture is associated with rotator cuff healing Open
Tendon-bone healing is impaired in the presence of OP but can be partially restored by ALN treatment. Furthermore, CT-based quantitative analysis of bone microarchitecture at the humeral greater tuberosity shows a significant correlation w…
View article: Anti‐Osteoporosis Treatment Alleviates Osteoarthritis Symptoms and Partially Reverses Disease Progression
Anti‐Osteoporosis Treatment Alleviates Osteoarthritis Symptoms and Partially Reverses Disease Progression Open
Objective Osteoarthritis (OA) and osteoporosis (OP) are highly prevalent in postmenopausal women; however, their relationship remains complex and controversial. This study aimed to investigate whether anti‐OP treatment alleviates osteoarth…
View article: HedraRAG: Coordinating LLM Generation and Database Retrieval in Heterogeneous RAG Serving
HedraRAG: Coordinating LLM Generation and Database Retrieval in Heterogeneous RAG Serving Open
This paper addresses emerging system-level challenges in heterogeneous retrieval-augmented generation (RAG) serving, where complex multi-stage workflows and diverse request patterns complicate efficient execution. We present HedraRAG, a ru…
View article: RoboVerse: A Unified Platform, Benchmark and Dataset for Scalable and Generalizable Robot Learning
RoboVerse: A Unified Platform, Benchmark and Dataset for Scalable and Generalizable Robot Learning Open
View article: Hardware-aware Calibration Protocol for Quantum Computers
Hardware-aware Calibration Protocol for Quantum Computers Open
View article: TRACI: Network Acceleration of Input-Dynamic Communication for Large-Scale Deep Learning Recommendation Model
TRACI: Network Acceleration of Input-Dynamic Communication for Large-Scale Deep Learning Recommendation Model Open
View article: SwitchQNet: Optimizing Distributed Quantum Computing for Quantum Data Centers with Switch Networks
SwitchQNet: Optimizing Distributed Quantum Computing for Quantum Data Centers with Switch Networks Open
View article: Construction of cartilaginous organoids based on cartilage extracellular matrix microcarriers to promote articular cartilage regeneration through immune regulation
Construction of cartilaginous organoids based on cartilage extracellular matrix microcarriers to promote articular cartilage regeneration through immune regulation Open
View article: KPerfIR: Towards an Open and Compiler-centric Ecosystem for GPU Kernel Performance Tooling on Modern AI Workloads
KPerfIR: Towards an Open and Compiler-centric Ecosystem for GPU Kernel Performance Tooling on Modern AI Workloads Open
In this work, we propose KPerfIR, a novel multilevel compiler-centric infrastructure to enable the development of customizable, extendable, and portable profiling tools tailored for modern artificial intelligence (AI) workloads on modern G…
View article: Conditional Generative Modeling for Amorphous Multi-Element Materials
Conditional Generative Modeling for Amorphous Multi-Element Materials Open
Amorphous multi-element materials offer unprecedented tunability in composition and properties, yet their rational design remains challenging due to the lack of predictive structure-property relationships and vast configurational space. Tr…
View article: OneAdapt: Adaptive Compilation for Resource-Constrained Photonic One-Way Quantum Computing
OneAdapt: Adaptive Compilation for Resource-Constrained Photonic One-Way Quantum Computing Open
Measurement-based quantum computing (MBQC), a.k.a. one-way quantum computing (1WQC), is a universal quantum computing model, which is particularly well-suited for photonic platforms. In this model, computation is driven by measurements on …
View article: Flexion: Adaptive In-Situ Encoding for On-Demand QEC in Ion Trap Systems
Flexion: Adaptive In-Situ Encoding for On-Demand QEC in Ion Trap Systems Open
Recent advances in quantum hardware and quantum error correction (QEC) have set the stage for early demonstrations of fault-tolerant quantum computing (FTQC). A key near-term goal is to build a system capable of executing millions of logic…
View article: S-QGPU: Shared quantum gate processing unit for distributed quantum computing
S-QGPU: Shared quantum gate processing unit for distributed quantum computing Open
We propose a distributed quantum computing (DQC) architecture in which individual small-sized quantum computers are connected to a shared quantum gate processing unit (S-QGPU). The S-QGPU comprises a collection of hybrid two-qubit gate mod…
View article: STQS: A Unified System Architecture for Spatial Temporal Quantum Sensing
STQS: A Unified System Architecture for Spatial Temporal Quantum Sensing Open
Quantum sensing (QS) harnesses quantum phenomena to measure physical observables with extraordinary precision, sensitivity, and resolution. Despite significant advancements in quantum sensing, prevailing efforts have focused predominantly …
View article: SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation
SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation Open
While spatial reasoning has made progress in object localization relationships, it often overlooks object orientation-a key factor in 6-DoF fine-grained manipulation. Traditional pose representations rely on pre-defined frames or templates…
View article: QECC-Synth: A Layout Synthesizer for Quantum Error Correction Codes on Sparse Architectures
QECC-Synth: A Layout Synthesizer for Quantum Error Correction Codes on Sparse Architectures Open
View article: Optimizing FTQC Programs through QEC Transpiler and Architecture Codesign
Optimizing FTQC Programs through QEC Transpiler and Architecture Codesign Open
Fault-tolerant quantum computing (FTQC) is essential for executing reliable quantum computations of meaningful scale. Widely adopted QEC codes for FTQC, such as the surface code and color codes, utilize Clifford+T gate sets, where T gates …
View article: Optimizing Quantum Communication for Quantum Data Centers with Reconfigurable Networks
Optimizing Quantum Communication for Quantum Data Centers with Reconfigurable Networks Open
Distributed Quantum Computing (DQC) enables scalability by interconnecting multiple QPUs. Among various DQC implementations, quantum data centers (QDCs), which utilize reconfigurable optical switch networks to link QPUs across different ra…
View article: SymBreak: Mitigating Quantum Degeneracy Issues in QLDPC Code Decoders by Breaking Symmetry
SymBreak: Mitigating Quantum Degeneracy Issues in QLDPC Code Decoders by Breaking Symmetry Open
Quantum error correction (QEC) is critical for scalable and reliable quantum computing, but existing solutions, such as surface codes, incur significant qubit overhead. Quantum low-density parity check (qLDPC) codes have recently emerged a…
View article: CaliScalpel: In-Situ and Fine-Grained Qubit Calibration Integrated with Surface Code Quantum Error Correction
CaliScalpel: In-Situ and Fine-Grained Qubit Calibration Integrated with Surface Code Quantum Error Correction Open
Quantum Error Correction (QEC) is a cornerstone of fault-tolerant, large-scale quantum computing. However, qubit error drift significantly degrades QEC performance over time, necessitating periodic calibration. Traditional calibration meth…
View article: PowerMove: Optimizing Compilation for Neutral Atom Quantum Computers with Zoned Architecture
PowerMove: Optimizing Compilation for Neutral Atom Quantum Computers with Zoned Architecture Open
Neutral atom-based quantum computers (NAQCs) have recently emerged as promising candidates for scalable quantum computing, largely due to their advanced hardware capabilities, particularly qubit movement and the zoned architecture (ZA). Ho…
View article: Architectures for Heterogeneous Quantum Error Correction Codes
Architectures for Heterogeneous Quantum Error Correction Codes Open
Quantum Error Correction (QEC) is essential for future quantum computers due to its ability to exponentially suppress physical errors. The surface code is a leading error-correcting code candidate because of its local topological structure…
View article: Corrigendum to: An Essential Role of c-Fos in Notch1-mediated Promotion of Proliferation of KSHV-Infected SH-SY5Y Cells
Corrigendum to: An Essential Role of c-Fos in Notch1-mediated Promotion of Proliferation of KSHV-Infected SH-SY5Y Cells Open
In the online version of the article, a change was made in the author's position. The affiliation of Dongmei Li and Jinli Zhang in the online version of the article titled “An Essential Role of c-Fos in Notch1-mediated Promotion of Prolife…
View article: DexGraspNet 2.0: Learning Generative Dexterous Grasping in Large-scale Synthetic Cluttered Scenes
DexGraspNet 2.0: Learning Generative Dexterous Grasping in Large-scale Synthetic Cluttered Scenes Open
Grasping in cluttered scenes remains highly challenging for dexterous hands due to the scarcity of data. To address this problem, we present a large-scale synthetic benchmark, encompassing 1319 objects, 8270 scenes, and 427 million grasps.…
View article: Improving GPU Multi-Tenancy Through Dynamic Multi-Instance GPU Reconfiguration
Improving GPU Multi-Tenancy Through Dynamic Multi-Instance GPU Reconfiguration Open
Continuous learning (CL) has emerged as one of the most popular deep learning paradigms deployed in modern cloud GPUs. Specifically, CL has the capability to continuously update the model parameters (through model retraining) and use the u…
View article: Large-scale self-normalizing neural networks
Large-scale self-normalizing neural networks Open
Self-normalizing neural networks (SNN) regulate the activation and gradient flows through activation functions with the self-normalization property. As SNNs do not rely on norms computed from minibatches, they are more friendly to data par…