Explanipedia

Optimal Scheduling Algorithms for LLM Inference: Theory and Practice Open

Gustavo de Veciana · 2025

With the growing use of Large Language Model (LLM)-based tools like ChatGPT, Perplexity, and Gemini across industries, there is a rising need for efficient LLM inference systems. These systems handle requests with a unique two-phase comput…

Batching-Aware Joint Model Onloading and Offloading for Hierarchical Multi-Task Inference Open

Seohyeon Cha, Kevin Chan, Gustavo de Veciana, Haris Vikalo · 2025

The growing demand for intelligent services on resource-constrained edge devices has spurred the development of collaborative inference systems that distribute workloads across end devices, edge servers, and the cloud. While most existing …

Importance Sampling via Score-based Generative Models Open

Heasung Kim, T. J. Lee, Hyeji Kim, Gustavo de Veciana · 2025

Computer science Mathematics

Importance sampling, which involves sampling from a probability density function (PDF) proportional to the product of an importance weight function and a base PDF, is a powerful technique with applications in variance reduction, biased or …

Ten Ways in which Virtual Reality Differs from Video Streaming Open

Gustavo de Veciana, Sonia Fahmy, George Kesidis, Voicu Popescu · 2024

Computer science

Virtual Reality (VR) applications have a number of unique characteristics that set them apart from traditional video streaming. These characteristics have major implications on the design of VR rendering, adaptation, prefetching, caching, …

Scheduling "Last Minute" Updates for Timely Decision-Making Open

Jean Abou Rahal, Gustavo de Veciana · 2023

Computer science Mathematics

We consider a setting where requests for updates regarding time-varying processes are required prior to making a sequence of decisions. Each request has a finite length time window during which the update should be received. The end of the…

Managing Edge Offloading for Stochastic Workloads with Deadlines Open

Agrim Bari, Gustavo de Veciana, Kerstin Johnsson, Alexander Pyattaev · 2023

Computer science Chemistry

Increasing demand for computationally intensive jobs on mobile devices is driving interest in computation offloading to the edge/cloud servers. This paper presents a comprehensive framework for managing offloading of stochastic and heterog…

Authors Open

Mohamed Mosc, Ahmed Abdelghaffar, Mohamed‐Slim Alouini, Antonia Arvanitaki, Umer Ashraf , et al. · 2023

Computer science Biology

To Re-transmit or Not to Re-transmit for Freshness WMOSC.4 635 Age-Based Cache Updating Under Timestomping Bang, Ban S4.4 111 An improved genetic algorithm for bi-level multi-objective Q-coverage in directional sensor networks Baras, John …

MOHAWK: Mobility and Heterogeneity-Aware Dynamic Community Selection for Hierarchical Federated Learning Open

Allen-Jasmin Farcas, Myungjin Lee, Ramana Rao Kompella, Hugo Latapie, Gustavo de Veciana , et al. · 2023

Computer science Engineering Mathematics

The recent developments in Federated Learning (FL) focus on optimizing the learning process for data, hardware, and model heterogeneity. However, most approaches assume all devices are stationary, charging, and always connected to the Wi-F…

Constrained Network Slicing Games: Achieving Service Guarantees and Network Efficiency Open

Jiaxiao Zheng, Albert Banchs, Gustavo de Veciana · 2023

Computer science Business

Network slicing is a key capability for next generation mobile networks. It enables infrastructure providers to cost effectively customize logical networks over a shared infrastructure. A critical component of network slicing is resource a…

Network Adaptive Federated Learning: Congestion and Lossy Compression Open

Parikshit Hegde, Gustavo de Veciana, Aryan Mokhtari · 2023

Computer science Materials science

In order to achieve the dual goals of privacy and learning across distributed data, Federated Learning (FL) systems rely on frequent exchanges of large files (model updates) between a set of clients and the server. As such FL systems are e…

Online learning for multi-agent based resource allocation in weakly coupled wireless systems Open

Jianhan Song, Gustavo de Veciana, Sanjay Shakkottai · 2022

Computer science Physics

We propose and evaluate a learning-based framework to address multi-agent resource allocation in coupled wireless systems. In particular we consider, multiple agents (e.g., base stations, access points, etc.) that choose amongst a set of r…

Performance and efficiency tradeoffs in blockchain overlay networks Open

Parikshit Hegde, Gustavo de Veciana · 2022

Computer science Physics Medicine

Underlying blockchain's scalability and performance is a Peer-to-Peer (P2P) overlay network and protocols for relaying blocks and transactions among participating nodes. In this work, we model and perform a systematic analysis of blockchai…

Federated Learning Under Intermittent Client Availability and Time-Varying Communication Constraints Open

Mónica Ribero, Haris Vikalo, Gustavo de Veciana · 2022

Computer science Psychology Economics

Federated learning systems facilitate training of global models in settings where potentially heterogeneous data is distributed across a large number of clients. Such systems operate in settings with intermittent client availability and/or…

Joint Scheduling of URLLC and eMBB Traffic in 5G Wireless Networks Open

Arjun Anand, Gustavo de Veciana, Sanjay Shakkottai · 2020

Computer science Mathematics

Emerging 5G systems will need to efficiently support both enhanced mobile broadband traffic (eMBB) and ultra-low-latency communications (URLLC) traffic. In these systems, time is divided into slots which are further sub-divided into minisl…

Book-Ahead & Supply Management for Ridesourcing Platforms Open

Cesar N. Yahia, Gustavo de Veciana, Stephen D. Boyles, Jean Abou Rahal, Michael Stecklein · 2020

Computer science Engineering Business

Ridesourcing platforms recently introduced the ``schedule a ride'' service where passengers may reserve (book-ahead) a ride in advance of their trip. Reservations give platforms precise information that describes the start time and locatio…

Book-Ahead & Supply Management for Ridesourcing Platforms Open

Cesar N. Yahia, Gustavo de Veciana, Stephen D. Boyles, Jean Abou Rahal, Michael Stecklein · 2020

Computer science Engineering Business

Ridesourcing platforms recently introduced the ``schedule a ride'' service where passengers may reserve (book-ahead) a ride in advance of their trip. Reservations give platforms precise information that describes the start time and locatio…

Performance Analysis of RSU-based Multihomed Multilane Vehicular Networks Open

Saadallah Kassir, Pablo Caballero, Gustavo de Veciana, Nannan Wang, Xi Wang , et al. · 2020

Computer science

Motivated by the potentially high downlink traffic demands of commuters in future autonomous vehicles, we study a network architecture where vehicles use Vehicle-to-Vehicle (V2V) links to form relay network clusters, which in turn use Vehi…

Performance Analysis of RSU-based Multihomed Multilane Vehicular\n Networks Open

Saadallah Kassir, Pablo Caballero, Gustavo de Veciana, Nannan Wang, Xi Wang , et al. · 2020

Computer science

Motivated by the potentially high downlink traffic demands of commuters in\nfuture autonomous vehicles, we study a network architecture where vehicles use\nVehicle-to-Vehicle (V2V) links to form relay network clusters, which in turn\nuse V…

Constrained Network Slicing Games: Achieving service guarantees and network efficiency Open

Jiaxiao Zheng, Gustavo de Veciana, Albert Banchs · 2020

Computer science Business Economics

Network slicing is a key capability for next generation mobile networks. It enables one to cost effectively customize logical networks over a shared infrastructure. A critical component of network slicing is resource allocation, which need…

Resource Allocation for Network Slicing in Mobile Networks Open

Albert Banchs, Gustavo de Veciana, Vincenzo Sciancalepore, Xavier Costa‐Pérez · 2020

Computer science Engineering Biology

This paper provides a survey of resource allocation for network slicing. We focus on two classes of existing solutions: (i) reservation-based approaches, which allocate resources on a reservation basis, and (ii) share-based approaches, whi…

Progressive Stochastic Greedy Sparse Reconstruction and Support Selection Open

Abolfazl Hashemi, Haris Vikalo, Gustavo de Veciana · 2019

Mathematics Computer science

Sparse reconstruction and sparse support selection, i.e., the tasks of inferring an arbitrary $m$-dimensional sparse vector $\mathbf{x}$ having $k$ nonzero entries from $n$ measurements of linear combinations of its components, are often e…

Stochastic-Greedy++: Closing the Optimality Gap in Exact Weak Submodular Maximization Open

Gustavo de Veciana, Abolfazl Hashemi, Haris Vikalo · 2019

Mathematics Computer science

Many problems in discrete optimization can be formulated as the task of maximizing a monotone and weak submodular function subject to a cardinality constraint. For such problems, a simple greedy algorithm is guaranteed to find a solution w…

Performance-Complexity Tradeoffs in Greedy Weak Submodular Maximization with Random Sampling Open

Abolfazl Hashemi, Haris Vikalo, Gustavo de Veciana · 2019

Mathematics Computer science

Many problems in signal processing and machine learning can be formalized as weak submodular optimization tasks. For such problems, a simple greedy algorithm (\textsc{Greedy}) is guaranteed to find a solution achieving the objective with a…

Performance-Complexity Tradeoffs in Greedy Weak Submodular Maximization\n with Random Sampling Open

Abolfazl Hashemi, Haris Vikalo, Gustavo de Veciana · 2019

Mathematics Computer science

Many problems in signal processing and machine learning can be formalized as\nweak submodular optimization tasks. For such problems, a simple greedy\nalgorithm (\\textsc{Greedy}) is guaranteed to find a solution achieving the\nobjective wi…

Analysis of Data Harvesting by Unmanned Aerial Vehicles Open

Chang‐Sik Choi, François Baccelli, Gustavo de Veciana · 2019

Computer science Mathematics Biology

International audience

Modeling and Optimization of Human-machine Interaction Processes via the Maximum Entropy Principle Open

Jiaxiao Zheng, Gustavo de Veciana · 2019

Computer science Mathematics Physics

We propose a data-driven framework to enable the modeling and optimization of human-machine interaction processes, e.g., systems aimed at assisting humans in decision-making or learning, work-load allocation, and interactive advertising. T…

Modeling and Analysis of Data Harvesting Architecture based on Unmanned\n Aerial Vehicles Open

Chang‐Sik Choi, François Baccelli, Gustavo de Veciana · 2019

Computer science Engineering Biology

This paper explores an emerging wireless Internet-of-things (IoT)\narchitecture based on unmanned aerial vehicles (UAVs). We consider a network\nwhere a fleet of UAVs at a fixed altitude flies on planned trajectories and IoT\ndevices on th…

Modeling and Analysis of Data Harvesting Architecture based on Unmanned Aerial Vehicles Open

Chang‐Sik Choi, François Baccelli, Gustavo de Veciana · 2019

Computer science Engineering Biology

This paper explores an emerging wireless Internet-of-things (IoT) architecture based on unmanned aerial vehicles (UAVs). We consider a network where a fleet of UAVs at a fixed altitude flies on planned trajectories and IoT devices on the g…

Network Slicing Games: Enabling Customization in Multi-Tenant Mobile Networks Open

Pablo Caballero, Albert Banchs, Gustavo de Veciana, Xavier Costa‐Pérez · 2019

Computer science Mathematics Economics

Network slicing to enable resource sharing among multiple tenants-network operators and/or services-is considered as a key functionality for next generation mobile networks. This paper provides an analysis of a well-known model for resourc…

Gustavo de Veciana YOU? Author Swipe