Explanipedia

Confidential LLM Inference: Performance and Cost Across CPU and GPU TEEs Open

Marcin Chrapek, Marcin Copik, Etienne Mettaz, Torsten Hoefler · 2025

Large Language Models (LLMs) are increasingly deployed on converged Cloud and High-Performance Computing (HPC) infrastructure. However, as LLMs handle confidential inputs and are fine-tuned on costly, proprietary datasets, their heightened…

SDR-RDMA: Software-Defined Reliability Architecture for Planetary Scale RDMA Communication Open

Mikhail Khalilov, Siyuan Shen, Marcin Chrapek, Tiancheng Chen, Kenji Nakano , et al. · 2025

RDMA is vital for efficient distributed training across datacenters, but millisecond-scale latencies complicate the design of its reliability layer. We show that depending on long-haul link characteristics, such as drop rate, distance and …

Fortify Your Foundations: Practical Privacy and Security for Foundation Model Deployments In The Cloud Open

Marcin Chrapek, Anjo Vahldiek-Oberwagner, Marcin Spoczynski, Scott Constable, Mona Vij , et al. · 2024

Computer science Political science

Foundation Models (FMs) display exceptional performance in tasks such as natural language processing and are being applied across a growing range of disciplines. Although typically trained on large public datasets, FMs are often fine-tuned…

Network-Offloaded Bandwidth-Optimal Broadcast and Allgather for Distributed AI Open

Mikhail Khalilov, Salvatore Di Girolamo, Marcin Chrapek, Rami Nudelman, Gil Bloch , et al. · 2024

Computer science

In the Fully Sharded Data Parallel (FSDP) training pipeline, collective operations can be interleaved to maximize the communication/computation overlap. In this scenario, outstanding operations such as Allgather and Reduce-Scatter can comp…

Multi-Head RAG: Solving Multi-Aspect Problems with LLMs Open

Maciej Besta, Ales Kubicek, Roman Niggli, Robert Gerstenberger, Lucas Weitzendorf , et al. · 2024

Computer science Biology

Retrieval Augmented Generation (RAG) enhances the abilities of Large Language Models (LLMs) by enabling the retrieval of documents into the LLM context to provide more accurate and relevant responses. Existing RAG solutions do not focus on…

LLAMP: Assessing Network Latency Tolerance of HPC Applications with Linear Programming Open

Siyuan Shen, Langwen Huang, Marcin Chrapek, Timo Schneider, Jai Dayal , et al. · 2024

Computer science

The shift towards high-bandwidth networks driven by AI workloads in data centers and HPC clusters has unintentionally aggravated network latency, adversely affecting the performance of communication-intensive HPC applications. As large-sca…

Software Resource Disaggregation for HPC with Serverless Computing Open

Marcin Copik, Marcin Chrapek, Larissa Schmid, Alexandru Calotoiu, Torsten Hoefler · 2024

Computer science Economics

Aggregated HPC resources have rigid allocation systems and programming models which struggle to adapt to diverse and changing workloads. Consequently, HPC systems fail to efficiently use the large pools of unused memory and increase the ut…

OSMOSIS: Enabling Multi-Tenancy in Datacenter SmartNICs Open

Mikhail Khalilov, Marcin Chrapek, Siyuan Shen, Alessandro Vezzu, Thomas Benz , et al. · 2023

Computer science

Multi-tenancy is essential for unleashing SmartNIC's potential in datacenters. Our systematic analysis in this work shows that existing on-path SmartNICs have resource multiplexing limitations. For example, existing solutions lack multi-te…

The saphenous vein harvest procedure affects the arteriovenous system and postoperative wound healing in patients following coronary aortic bypass surgery Open

Karol Froń, Marcin Chrapek, Witold Bratkowski, Oldi Ruci, Jerzy Pacholewicz · 2023

Medicine

ENWEndNote BIBJabRef, Mendeley RISPapers, Reference Manager, RefWorks, Zotero AMA Froń K, Chrapek M, Bratkowski W, Ruci O, Pacholewicz J. The saphenous vein harvest procedure affects the arteriovenous system and postoperative wound healing…

Marcin Chrapek YOU? Author Swipe