Explanipedia

FourierCompress: Layer-Aware Spectral Activation Compression for Efficient and Accurate Collaborative LLM Inference Open

Jian Ma, Xinchen Lyu, Jun Jiang, Longhao Zou, Chenshan Ren , et al. · 2025

Collaborative large language model (LLM) inference enables real-time, privacy-preserving AI services on resource-constrained edge devices by partitioning computational workloads between client devices and edge servers. However, this paradi…

Objective-Driven Differentiable Optimization of Traffic Prediction and Resource Allocation for Split AI Inference Edge Networks Open

Xinchen Lyu, Y Li, Ying He, Chenshan Ren, Wei Ni , et al. · 2024

Split AI inference partitions an artificial intelligence (AI) model into multiple parts, enabling the offloading of computation-intensive AI services. Resource allocation is critical for the performance of split AI inference. The challenge…

Online-Learning-Based Predictive Optimization of Uplink Scheduling for Industrial Internet-of-Things Open

Chenshan Ren, Xinchen Lyu · 2024

The industrial Internet of Things (IIoT) operates in dynamic environments where wireless channels are subject to rapid changes, posing significant challenges for reliable data transmission. This paper introduces a novel online learning app…

Mobile Edge Computing for Future Internet-of-Things Open

Chenshan Ren · 2020

Chenshan Ren YOU? Author Swipe