Yipeng Shen YOU? Author Swipe

Last 10y

Open Invitation to Help Curate This Field & Enhance Impact .ORG

KVFlow: Efficient Prefix Caching for Accelerating LLM-Based Multi-Agent Workflows Open

Ziqi Pan, Ankitkumar N. Patel, Zhengyong Hu, Yipeng Shen, Yue Guan , et al. · 2025

Large language model (LLM) based agentic workflows have become a popular paradigm for coordinating multiple specialized agents to solve complex tasks. To improve serving efficiency, existing LLM systems employ prefix caching to reuse key-v…

Creating related items for first view…