Martin Kruliš
YOU?
Author Swipe
View article: Layout-Agnostic MPI Abstraction for Distributed Computing in Modern C++
Layout-Agnostic MPI Abstraction for Distributed Computing in Modern C++ Open
View article: Tutoring LLM into a Better CUDA Optimizer
Tutoring LLM into a Better CUDA Optimizer Open
View article: Abstractions for C++ code optimizations in parallel high-performance applications
Abstractions for C++ code optimizations in parallel high-performance applications Open
Many computational problems consider memory throughput a performance bottleneck, especially in the domain of parallel computing. Software needs to be attuned to hardware features like cache architectures or concurrent memory banks to reach…
View article: Tackling Students' Coding Assignments with LLMs
Tackling Students' Coding Assignments with LLMs Open
View article: Pure C++ Approach to Optimized Parallel Traversal of Regular Data Structures
Pure C++ Approach to Optimized Parallel Traversal of Regular Data Structures Open
Many computational problems consider memory throughput a performance bottleneck. The problem becomes even more pronounced in the case of parallel platforms, where the ratio between computing elements and memory bandwidth shifts towards com…
View article: GPU-acceleration of neighborhood-based dimensionality reduction algorithm EmbedSOM
GPU-acceleration of neighborhood-based dimensionality reduction algorithm EmbedSOM Open
Dimensionality reduction methods have found vast applications as visualization tools in diverse areas of science. Although many different methods exist, their performance is often insufficient for providing quick insight into many contempo…
View article: Online ML Self-adaptation in Face of Traps
Online ML Self-adaptation in Face of Traps Open
Online machine learning (ML) is often used in self-adaptive systems to strengthen the adaptation mechanism and improve the system utility. Despite such benefits, applying online ML for self-adaptation can be challenging, and not many paper…
View article: Efficient Gpu-Accelerated Parallel Cross-Correlation
Efficient Gpu-Accelerated Parallel Cross-Correlation Open
View article: Modeling Machine Learning Concerns in Collective Adaptive Systems
Modeling Machine Learning Concerns in Collective Adaptive Systems Open
View article: Astute Approach to Handling Memory Layouts of Regular Data Structures
Astute Approach to Handling Memory Layouts of Regular Data Structures Open
View article: Scalable semi-supervised dimensionality reduction with GPU-accelerated EmbedSOM
Scalable semi-supervised dimensionality reduction with GPU-accelerated EmbedSOM Open
Dimensionality reduction methods have found vast application as visualization tools in diverse areas of science. Although many different methods exist, their performance is often insufficient for providing quick insight into many contempor…
View article: Towards Model-driven Fuzzification of Adaptive Systems Specification
Towards Model-driven Fuzzification of Adaptive Systems Specification Open
View article: Ensemble-Based Modeling Abstractions for Modern Self-optimizing Systems
Ensemble-Based Modeling Abstractions for Modern Self-optimizing Systems Open
View article: Towards fuzzification of adaptation rules in self-adaptive architectures
Towards fuzzification of adaptation rules in self-adaptive architectures Open
In this paper, we focus on exploiting neural networks for the analysis and planning stage in self-adaptive architectures. The studied motivating cases in the paper involve existing (legacy) self-adaptive architectures and their adaptation …
View article: GPU-Accelerated Mahalanobis-Average Hierarchical Clustering Analysis
GPU-Accelerated Mahalanobis-Average Hierarchical Clustering Analysis Open
View article: Towards Efficient Locality Aware Parallel Data Stream Processing
Towards Efficient Locality Aware Parallel Data Stream Processing Open
View article: Data Preprocessing of eSport Game Records - Counter-Strike: Global Offensive
Data Preprocessing of eSport Game Records - Counter-Strike: Global Offensive Open
View article: Towards Efficient Locality Aware Parallel Data Stream Processing
Towards Efficient Locality Aware Parallel Data Stream Processing Open
Parallel data processing and parallel streaming systems become quite popular. They are employed in various domains such as real-time signal processing, OLAP database systems, or high performance data extraction. One of the key components o…
View article: A Survey of CUDA-based Multidimensional Scaling on GPU Architecture
A Survey of CUDA-based Multidimensional Scaling on GPU Architecture Open
The need to analyze large amounts of multivariate data raises the fundamental problem of dimensionality reduction which is defined as a process of mapping data from high-dimensional space into low-dimensional. One of the most popular metho…