Peter A. Dinda
YOU?
Author Swipe
TRUSTCHECKPOINTS: Time Betrays Malware for Unconditional Software Root of Trust Open
Modern IoT and embedded platforms must start execution from a known trusted state to thwart malware, ensure secure firmware updates, and protect critical infrastructure. Current approaches to establish a root of trust depend on secret keys…
Efficient Video Redaction at the Edge: Human Motion Tracking for Privacy Protection Open
Computationally efficient, camera-based, real-time human position tracking on low-end, edge devices would enable numerous applications, including privacy-preserving video redaction and analysis. Unfortunately, running most deep neural netw…
CAMP: Compiler and Allocator-based Heap Memory Protection Open
The heap is a critical and widely used component of many applications. Due to its dynamic nature, combined with the complexity of heap management algorithms, it is also a frequent target for security exploits. To enhance the heap's securit…
Compiling Loop-Based Nested Parallelism for Irregular Workloads Open
Modern programming languages offer special syntax and semantics for logical fork-join parallelism in the form of parallel loops, allowing them to be nested, e.g., a parallel loop within another parallel loop. This expressiveness comes at a…
TrackFM: Far-out Compiler Support for a Far Memory World Open
Large memory workloads with favorable locality of reference can benefit by extending the memory hierarchy across machines. Systems that enable such far memory configurations can improve application performance and overall memory utilizatio…
Task parallel assembly language for uncompromising parallelism Open
Achieving parallel performance and scalability involves making compromises between parallel and sequential computation. If not contained, the overheads of parallelism can easily outweigh its benefits, sometimes by orders of magnitude. Toda…
View article: Paths to Fast Barrier Synchronization on the Node
Paths to Fast Barrier Synchronization on the Node Open
Synchronization primitives like barriers heavily impact the performance of parallel programs. As core counts increase and granularity decreases, the value of enabling fast barriers increases. Through the evaluation of the performance of a …
Unconventional Parallelization of Nondeterministic Applications Open
The demand for thread-level-parallelism (TLP) on commodity processors is endless as it is essential for gaining performance and saving energy. However, TLP in today's programs is limited by dependences that must be satisfied at run time. W…
Multiverse: Easy Conversion of Runtime Systems into OS Kernels via Automatic Hybridization Open
The hybrid runtime (HRT) model offers a path towards high performance and efficiency. By integrating the OS kernel, runtime, and application, an HRT allows the runtime developer to leverage the full feature set of the hardware and speciali…
Enabling Hybrid Parallel Runtimes Through Kernel and Virtualization Support Open
In our hybrid runtime (HRT) model, a parallel runtime system and the application are together transformed into a specialized OS kernel that operates entirely in kernel mode and can thus implement exactly its desired abstractions on top of …