Laxmi N. Bhuyan
YOU?
Author Swipe
View article: Improving Energy Saving of One-Sided Matrix Decompositions on CPU-GPU Heterogeneous Systems
Improving Energy Saving of One-Sided Matrix Decompositions on CPU-GPU Heterogeneous Systems Open
One-sided dense matrix decompositions (e.g., Cholesky, LU, and QR) are the key components in scientific computing in many different fields. Although their design has been highly optimized for modern processors, they still consume a conside…
View article: GreenMD: Energy-efficient Matrix Decomposition on Heterogeneous Multi-GPU Systems
GreenMD: Energy-efficient Matrix Decomposition on Heterogeneous Multi-GPU Systems Open
The current trend of performance growth in HPC systems is accompanied by a massive increase in energy consumption. In this article, we introduce GreenMD, an energy-efficient framework for heterogeneous systems for LU factorization utilizin…
View article: Variable metaplastic entities in pleomorphic adenoma a review of a rare case report with a note on its significance
Variable metaplastic entities in pleomorphic adenoma a review of a rare case report with a note on its significance Open
Pleomorphic adenoma is the most common benign salivary gland neoplasm principally affecting the parotid gland of the salivary gland and the palate of the minor salivary gland. The term pleomorphic is assigned due to its varied histopatholo…
View article: Impression Cytology's Reliability as an Effective Method for Ophthalmic Neoplasm Detection
Impression Cytology's Reliability as an Effective Method for Ophthalmic Neoplasm Detection Open
Background: The current investigation was intended to evaluate the precision of impression cytology and tissue histology in the detection of ocular surface neoplasia. Materials and Methods: We examined the histories of patients detected wi…
View article: SmartWatch
SmartWatch Open
Despite advances in network security, attacks targeting mission critical systems and applications remain a significant problem for network and datacenter providers. Existing telemetry platforms detect volumetric attacks at terabit scales u…
View article: PAVER
PAVER Open
The massive parallelism present in GPUs comes at the cost of reduced L1 and L2 cache sizes per thread, leading to serious cache contention problems such as thrashing. Hence, the data access locality of an application should be considered d…
View article: Swan
Swan Open
The service quality of web search depends considerably on the request tail latency from Index Serving Nodes (ISNs), prompting data centers to operate them at low utilization and wasting server power. ISNs can be made more energy efficient …
View article: Slumber
Slumber Open
The leakage power dissipation has become one of the major concerns with technology scaling. The GPGPU register file has grown in size over last decade in order to support the parallel execution of thousands of threads. Given that each thre…
View article: SAOU
SAOU Open
The current trend of ever-increasing performance in scientific applications comes with tremendous growth in energy consumption. In this paper, we present a framework for GPU applications, which reduces energy consumption in GPUs through Sa…
View article: GreenMM
GreenMM Open
The current trend of ever-increasing performance in scientific applications comes with tremendous growth in energy consumption. In this paper, we present GreenMM framework for matrix multiplication, which reduces energy consumption in GPUs…
View article: DREAM
DREAM Open
Traffic consolidation has been proposed to save energy in data center networks. However, existing centralized traffic consolidation approaches focus on achieving optimal network energy saving, without considering the need to be responsive …
View article: Juggler
Juggler Open
Scientific applications with single instruction, multiple data (SIMD) computations show considerable performance improvements when run on today's graphics processing units (GPUs). However, the existence of data dependences across thread bl…
View article: Wireframe
Wireframe Open
GPUs lack fundamental support for data-dependent parallelism and synchronization. While CUDA Dynamic Parallelism signals progress in this direction, many limitations and challenges still re-main. This paper introducesWireframe, a hardware-…
View article: Efficient warp execution in presence of divergence with collaborative context collection
Efficient warp execution in presence of divergence with collaborative context collection Open
GPU's SIMD architecture is a double-edged sword confronting parallel tasks with control flow divergence. On the one hand, it provides a high performance yet power-efficient platform to accelerate applications via massive parallelism; howev…
View article: Tumbler
Tumbler Open
Schedulers used by modern OSs (e.g., Oracle Solaris 11™ and GNU/Linux) balance load by balancing the number of threads in run queues of different cores. While this approach is effective for a single CPU multicore system, we show that it ca…