Explanipedia

Scalar Vector Runahead: Removing the Shackles of Indirect Memory Chains on In-Order Cores Open

Jaime Roelandts, Ajeya Naithani, Sam Ainsworth, Timothy M. Jones, Lieven Eeckhout · 2025

Modern processors often face the memory wall as a bottleneck, an exacerbated problem for stall-on-use in-order cores. Despite this limitation, there is growing demand for energy-efficient in-order cores due to privacy and sustainability co…

The Architectural Sustainability Indicator Open

Jaime Roelandts, Ajeya Naithani, Lieven Eeckhout · 2025

Computer science Business Engineering

Computing devices are responsible for a significant fraction of the world's total carbon footprint. Designing sustainable systems is a challenging endeavor because of the huge design space, the complex objective function, and the inherent …

Scalar Vector Runahead Open

Jaime Roelandts, Ajeya Naithani, Sam Ainsworth, Timothy M. Jones, Lieven Eeckhout · 2024

Computer science Physics Mathematics

Modern graph and database processing typically takes place on high-end servers in data centers. However, with growing concerns of data privacy, trustworthiness, and all-time connectivity, there has been a shift toward increased analytics p…

Decoupled Vector Runahead for Prefetching Nested Memory-Access Chains Open

Ajeya Naithani, Jaime Roelandts, Sam Ainsworth, Timothy M. Jones, Lieven Eeckhout · 2024

Computer science

Decoupled vector runahead (DVR) exploits massive amounts of memory-level parallelism to improve the performance of applications that feature indirect memory accesses by dynamically inferring loop bounds at runtime, recognizing striding loa…

Decoupled Vector Runahead Open

Ajeya Naithani, Jaime Roelandts, Sam Ainsworth, Timothy M. Jones, Lieven Eeckhout · 2023

Computer science Mathematics

We present Decoupled Vector Runahead (DVR), an in-core prefetching technique, executing separately to the main application thread, that exploits massive amounts of memory-level parallelism to improve the performance of applications featuri…

Vector Runahead for Indirect Memory Accesses Open

Ajeya Naithani, Sam Ainsworth, Timothy M. Jones, Lieven Eeckhout · 2022

Computer science

Vector runahead delivers extremely high memory-level parallelism even for the chains of dependent memory accesses with complex intermediate address computation, which conventional runahead techniques fundamentally cannot handle and, theref…

The Forward Slice Core: A High-Performance, Yet Low-Complexity Microarchitecture Open

Kartik Lakshminarasimhan, Ajeya Naithani, Josué Feliu, Lieven Eeckhout · 2022

Computer science

Superscalar out-of-order cores deliver high performance at the cost of increased complexity and power budget. In-order cores, in contrast, are less complex and have a smaller power budget, but offer low performance. A processor architectur…

VMT: Virtualized Multi-Threading for Accelerating Graph Workloads on Commodity Processors Open

Josué Feliu, Ajeya Naithani, Julio Sahuquillo, Salvador Petit, Moinuddin K. Qureshi , et al. · 2021

Computer science Physics

[EN] Modern-day graph workloads operate on huge graphs through pointer chasing which leads to high last-level cache (LLC) miss rates and limited memory-level parallelism (MLP). Simultaneous Multi-Threading (SMT) effectively hides the memor…

Vector Runahead Open

Ajeya Naithani, Sam Ainsworth, Timothy M. Jones, Lieven Eeckhout · 2021

Computer science

The memory wall places a significant limit on performance for many modern workloads. These applications feature complex chains of dependent, indirect memory accesses, which cannot be picked up by even the most advanced microarchitectural p…

The Forward Slice Core Microarchitecture Open

Kartik Lakshminarasimhan, Ajeya Naithani, Josué Feliu, Lieven Eeckhout · 2020

Computer science Economics

Superscalar out-of-order cores deliver high performance at the cost of increased complexity and power budget. In-order cores, in contrast, are less complex and have a smaller power budget, but offer low performance. A processor architectur…

Precise Runahead Execution Open

Ajeya Naithani, Josué Feliu, Almutaz Adileh, Lieven Eeckhout · 2020

Computer science Engineering

Runahead execution improves processor performance by accurately prefetching long-latency memory accesses. When a long-latency load causes the instruction window to fill up and halt the pipeline, the processor enters runahead mode and keeps…

Precise Runahead Execution Open

Ajeya Naithani, Josué Feliu, Almutaz Adileh, Lieven Eeckhout · 2019

Computer science Engineering Biology

© 2019 IEEE. Personal use of this material is permitted. Permissíon from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertisíng or promotional purposes, cre…

Optimizing Soft Error Reliability Through Scheduling on Heterogeneous Multicore Processors Open

Ajeya Naithani, Stijn Eyerman, Lieven Eeckhout · 2017

Computer science Engineering Physics

Reliability to soft errors is an increasingly important issue as technology continues to shrink. In this paper, we show that applications exhibit different reliability characteristics on big, high-performance cores versus small, power-effi…

Ajeya Naithani YOU? Author Swipe