Explanipedia

XaaS Containers: Performance-Portable Representation With Source and IR Containers Open

Marcin Copik, Eiman Alnuaimi, Alok Kamatar, Valérie Hayot‐Sasson, Alberto Madonna , et al. · 2025

High-performance computing (HPC) systems and cloud data centers are converging, and containers are becoming the default method of portable software deployment. Yet, while containers simplify software management, they face significant perfo…

Confidential LLM Inference: Performance and Cost Across CPU and GPU TEEs Open

Marcin Chrapek, Marcin Copik, Etienne Mettaz, Torsten Hoefler · 2025

Large Language Models (LLMs) are increasingly deployed on converged Cloud and High-Performance Computing (HPC) infrastructure. However, as LLMs handle confidential inputs and are fine-tuned on costly, proprietary datasets, their heightened…

AI Factories: It's time to rethink the Cloud-HPC divide Open

Pedro Garcı́a-López, Daniel Barcelona-Pons, Marcin Copik, Torsten Hoefler, Eduardo Quiñones , et al. · 2025

The strategic importance of artificial intelligence is driving a global push toward Sovereign AI initiatives. Nationwide governments are increasingly developing dedicated infrastructures, called AI Factories (AIF), to achieve technological…

DaCe AD: Unifying High-Performance Automatic Differentiation for Machine Learning and Scientific Computing Open

Abdelghani Boudaoud, Alexandru Calotoiu, Marcin Copik, Torsten Hoefler · 2025

Automatic differentiation (AD) is a set of techniques that systematically applies the chain rule to compute the gradients of functions without requiring human intervention. Although the fundamentals of this technology were established deca…

Cppless: Single-Source and High-Performance Serverless Programming in C++ Open

Marcin Copik, Lukas Möller, Alexandru Calotoiu, Torsten Hoefler · 2025

Computer science

The rise of serverless computing introduced a new class of scalable, elastic, and widely available parallel workers in the cloud. Many systems and applications benefit from offloading computations and parallel tasks to dynamically allocate…

Higher-Order Graph Databases Open

Maciej Besta, Sneha Prabha Chandran, Jakub Cudak, Patrick Iff, Marcin Copik , et al. · 2025

Recent advances in graph databases (GDBs) have been driving interest in large-scale analytics, yet current systems fail to support higher-order (HO) interactions beyond first-order (one-hop) relations, which are crucial for tasks such as s…

Affordable AI Assistants with Knowledge Graph of Thoughts Open

Maciej Besta, Lorenzo Paleari, Junyue Jiang, Robert Gerstenberger, You Wu , et al. · 2025

Computer science Business

Large Language Models (LLMs) are revolutionizing the development of AI assistants capable of performing diverse tasks across domains. However, current state-of-the-art LLM-driven agents face significant challenges, including high operation…

Reasoning Language Models: A Blueprint Open

Maciej Besta, Jessica Barth, Eric Schreiber, Ales Kubicek, Afonso Claudino Catarino , et al. · 2025

Computer science Psychology Engineering

Reasoning language models (RLMs), also known as Large Reasoning Models (LRMs), such as OpenAI's o1 and o3, DeepSeek-R1, and Alibaba's QwQ, have redefined AI's problem-solving capabilities by extending LLMs with advanced reasoning mechanism…

Core Hours and Carbon Credits: Incentivizing Sustainability in HPC Open

Alok Kamatar, Maxime Gonthier, Valérie Hayot‐Sasson, André Bauer, Marcin Copik , et al. · 2025

Business Economics Computer science

Realizing a shared responsibility between providers and consumers is critical to manage the sustainability of HPC. However, while cost may motivate efficiency improvements by infrastructure operators, broader progress is impeded by a lack …

A Priori Loop Nest Normalization: Automatic Loop Scheduling in Complex Applications Open

Lukas Trümper, Philipp Schaad, Berke Ates, Alexandru Calotoiu, Marcin Copik , et al. · 2024

Computer science Mathematics Sociology

The same computations are often expressed differently across software projects and programming languages. In particular, how computations involving loops are expressed varies due to the many possibilities to permute and compose loops. Sinc…

SeBS-Flow: Benchmarking Serverless Cloud Function Workflows Open

Larissa Schmid, Marcin Copik, Alexandru Calotoiu, Laurin Brandner, Anne Koziolek , et al. · 2024

Computer science Business Physics

Serverless computing has emerged as a prominent paradigm, with a significant adoption rate among cloud customers. While this model offers advantages such as abstraction from the deployment and resource scheduling, it also poses limitations…

XaaS: Acceleration as a Service to Enable Productive High-Performance Cloud Computing Open

Torsten Hoefler, Marcin Copik, Pete Beckman, Andrew Jones, Ian Foster , et al. · 2024

Computer science Business Physics

High-performance computing (HPC) and the cloud have evolved independently, specializing their innovations into performance or productivity. Acceleration as a Service (XaaS) is a recipe to empower both fields with a shared execution platfor…

Demystifying Chains, Trees, and Graphs of Thoughts Open

Maciej Besta, Florim Memedi, Zhenyu Zhang, Robert Gerstenberger, Nils Blach , et al. · 2024

Computer science Engineering Psychology

The field of natural language processing (NLP) has witnessed significant progress in recent years, with a notable focus on improving large language models' (LLM) performance through innovative prompting techniques. Among these, prompt engi…

Cppless: Single-Source and High-Performance Serverless Programming in C++ Open

Lukas Möller, Marcin Copik, Alexandru Calotoiu, Torsten Hoefler · 2024

Computer science

The rise of serverless computing introduced a new class of scalable, elastic and widely available parallel workers in the cloud. Many systems and applications benefit from offloading computations and parallel tasks to dynamically allocated…

Software Resource Disaggregation for HPC with Serverless Computing Open

Marcin Copik, Marcin Chrapek, Larissa Schmid, Alexandru Calotoiu, Torsten Hoefler · 2024

Computer science Economics

Aggregated HPC resources have rigid allocation systems and programming models which struggle to adapt to diverse and changing workloads. Consequently, HPC systems fail to efficiently use the large pools of unused memory and increase the ut…

XaaS: Acceleration as a Service to Enable Productive High-Performance Cloud Computing Open

Torsten Hoefler, Marcin Copik, Pete Beckman, Andrew Jones, Ian Foster , et al. · 2024

Computer science Mathematics Art

HPC and Cloud have evolved independently, specializing their innovations into performance or productivity. Acceleration as a Service (XaaS) is a recipe to empower both fields with a shared execution platform that provides transparent acces…

User-guided Page Merging for Memory Deduplication in Serverless Systems Open

Wei Qiu, Marcin Copik, Yun Wang, Alexandru Calotoiu, Torsten Hoefler · 2023

Computer science

Serverless computing is an emerging cloud paradigm that offers an elastic and scalable allocation of computing resources with pay-as-you-go billing. In the Function-as-a-Service (FaaS) programming model, applications comprise short-lived a…

FMI: Fast and Cheap Message Passing for Serverless Functions Open

Marcin Copik, Roman Böhringer, Alexandru Calotoiu, Torsten Hoefler · 2023

Computer science Physics Biology

Serverless functions provide elastic scaling and a fine-grained billing model, making Function-as-a-Service (FaaS) an attractive programming model. However, for distributed jobs that benefit from large-scale and dynamic parallelism, the la…

MOM: Matrix Operations in MLIR Open

Lorenzo Chelini, Henrik Barthels, Paolo Bientinesi, Marcin Copik, Tobias Grosser , et al. · 2022

Computer science Mathematics Materials science

Modern research in code generators for dense linear algebra computations has shown the ability to produce optimized code with a performance which compares and often exceeds the one of state-of-the-art implementations by domain experts. How…

FaaSKeeper: Learning from Building Serverless Services with ZooKeeper as an Example Open

Marcin Copik, Alexandru Calotoiu, Konstantin Taranov, Torsten Hoefler · 2022

Computer science Business Philosophy

FaaS (Function-as-a-Service) revolutionized cloud computing by replacing persistent virtual machines with dynamically allocated resources. This shift trades locality and statefulness for a pay-as-you-go model more suited to variable and in…

SISA: Set-Centric Instruction Set Architecture for Graph Mining on Processing-in-Memory Systems Open

Maciej Besta, Raghavendra Kanakagiri, Grzegorz Kwaśniewski, Rachata Ausavarungnirun, Jakub Beránek , et al. · 2021

Computer science

Simple graph algorithms such as PageRank have been the target of numerous hardware accelerators. Yet, there also exist much more complex graph mining algorithms for problems such as clustering or maximal clique listing. These algorithms ar…

SeBS: A Serverless Benchmark Suite for Function-as-a-Service Computing Open

Marcin Copik, Grzegorz Kwaśniewski, Maciej Besta, Michał Podstawski, Torsten Hoefler · 2021

Computer science Geography Business

This upload contains the software prototype, data, analysis scripts, and replication scripts for the paper "SeBS: A Serverless Benchmark Suite for Function-as-a-Service Computing" (ACM/IFIP Middleware 2021). With our artifact we provide th…

SeBS: A Serverless Benchmark Suite for Function-as-a-Service Computing Open

Marcin Copik, Grzegorz Kwaśniewski, Maciej Besta, Michał Podstawski, Torsten Hoefler · 2021

Computer science Geography Business

This upload contains the software prototype, data, analysis scripts, and replication scripts for the paper "SeBS: A Serverless Benchmark Suite for Function-as-a-Service Computing" (ACM/IFIP Middleware 2021). With our artifact we provide th…

SeBS: A Serverless Benchmark Suite for Function-as-a-Service Computing Open

Marcin Copik, Grzegorz Kwaśniewski, Maciej Besta, Michał Podstawski, Torsten Hoefler · 2021

Computer science Geography Biology

This upload contains the software prototype, data, analysis scripts, and replication scripts for the paper "SeBS: A Serverless Benchmark Suite for Function-as-a-Service Computing" (ACM/IFIP Middleware 2021). With our artifact we provide th…

Work-Stealing Prefix Scan: Addressing Load Imbalance in Large-Scale Image Registration Open

Marcin Copik, Tobias Grosser, Torsten Hoefler, Paolo Bientinesi, Benjamin Berkels · 2021

Computer science Biology Philosophy

Parallelism patterns (e.g., map or reduce) have proven to be effective tools for parallelizing high-performance applications. In this paper, we study the recursive registration of a series of electron microscopy images - a time consuming a…

rFaaS: Enabling High Performance Serverless with RDMA and Leases Open

Marcin Copik, Konstantin Taranov, Alexandru Calotoiu, Torsten Hoefler · 2021

Computer science Economics

High performance is needed in many computing systems, from batch-managed supercomputers to general-purpose cloud platforms. However, scientific clusters lack elastic parallelism, while clouds cannot offer competitive costs for high-perform…

GraphMineSuite: Enabling High-Performance and Programmable Graph Mining\n Algorithms with Set Algebra Open

Maciej Besta, Zur Vonarburg-Shmaria, Yannick Schaffner, Leonardo Schwarz, Grzegorz Kwaśniewski , et al. · 2021

Computer science Business

We propose GraphMineSuite (GMS): the first benchmarking suite for graph\nmining that facilitates evaluating and constructing high-performance graph\nmining algorithms. First, GMS comes with a benchmark specification based on\nextensive lit…

GraphMineSuite: Enabling High-Performance and Programmable Graph Mining Algorithms with Set Algebra Open

Maciej Besta, Zur Vonarburg-Shmaria, Yannick Schaffner, Leonardo Schwarz, Grzegorz Kwaśniewski , et al. · 2021

Computer science Business

We propose GraphMineSuite (GMS): the first benchmarking suite for graph mining that facilitates evaluating and constructing high-performance graph mining algorithms. First, GMS comes with a benchmark specification based on extensive litera…

Marcin Copik YOU? Author Swipe