Patrick Iff
YOU?
Author Swipe
View article: Higher-Order Graph Databases
Higher-Order Graph Databases Open
Recent advances in graph databases (GDBs) have been driving interest in large-scale analytics, yet current systems fail to support higher-order (HO) interactions beyond first-order (one-hop) relations, which are crucial for tasks such as s…
View article: Affordable AI Assistants with Knowledge Graph of Thoughts
Affordable AI Assistants with Knowledge Graph of Thoughts Open
Large Language Models (LLMs) are revolutionizing the development of AI assistants capable of performing diverse tasks across domains. However, current state-of-the-art LLM-driven agents face significant challenges, including high operation…
View article: PlaceIT: Placement-based Inter-Chiplet Interconnect Topologies
PlaceIT: Placement-based Inter-Chiplet Interconnect Topologies Open
2.5D integration technology is gaining traction as it copes with the exponentially growing design cost of modern integrated circuits. A crucial part of a 2.5D stacked chip is a low-latency and high-throughput inter-chiplet interconnect (IC…
View article: Reasoning Language Models: A Blueprint
Reasoning Language Models: A Blueprint Open
Reasoning language models (RLMs), also known as Large Reasoning Models (LRMs), such as OpenAI's o1 and o3, DeepSeek-R1, and Alibaba's QwQ, have redefined AI's problem-solving capabilities by extending LLMs with advanced reasoning mechanism…
View article: Hardware Acceleration for Knowledge Graph Processing: Challenges & Recent Developments
Hardware Acceleration for Knowledge Graph Processing: Challenges & Recent Developments Open
Knowledge graphs (KGs) have achieved significant attention in recent years, particularly in the area of the Semantic Web as well as gaining popularity in other application domains such as data mining and search engines. Simultaneously, the…
View article: Multi-Head RAG: Solving Multi-Aspect Problems with LLMs
Multi-Head RAG: Solving Multi-Aspect Problems with LLMs Open
Retrieval Augmented Generation (RAG) enhances the abilities of Large Language Models (LLMs) by enabling the retrieval of documents into the LLM context to provide more accurate and relevant responses. Existing RAG solutions do not focus on…
View article: CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks
CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks Open
Large Language Models (LLMs) are transforming a wide range of domains, yet verifying their outputs remains a significant challenge, especially for complex open-ended tasks such as consolidation, summarization, and knowledge extraction. To …
View article: Near-Optimal Wafer-Scale Reduce
Near-Optimal Wafer-Scale Reduce Open
Efficient Reduce and AllReduce communication collectives are a critical cornerstone of high-performance computing (HPC) applications. We present the first systematic investigation of Reduce and AllReduce on the Cerebras Wafer-Scale Engine …
View article: RapidChiplet: A Toolchain for Rapid Design Space Exploration of Chiplet Architectures
RapidChiplet: A Toolchain for Rapid Design Space Exploration of Chiplet Architectures Open
Chiplet architectures are on the rise as they promise to overcome the scaling challenges of monolithic chips. A key component of such architectures is an efficient inter-chiplet interconnect (ICI). The ICI design space is huge as there are…
View article: A High-Performance Design, Implementation, Deployment, and Evaluation of The Slim Fly Network
A High-Performance Design, Implementation, Deployment, and Evaluation of The Slim Fly Network Open
Novel low-diameter network topologies such as Slim Fly (SF) offer significant cost and power advantages over the established Fat Tree, Clos, or Dragonfly. To spearhead the adoption of low-diameter networks, we design, implement, deploy, an…
View article: HexaMesh: Scaling to Hundreds of Chiplets with an Optimized Chiplet Arrangement
HexaMesh: Scaling to Hundreds of Chiplets with an Optimized Chiplet Arrangement Open
2.5D integration is an important technique to tackle the growing cost of manufacturing chips in advanced technology nodes. This poses the challenge of providing high-performance inter-chiplet interconnects (ICIs). As the number of chiplets…
View article: Sparse Hamming Graph: A Customizable Network-on-Chip Topology
Sparse Hamming Graph: A Customizable Network-on-Chip Topology Open
Chips with hundreds to thousands of cores require scalable networks-on-chip (NoCs). Customization of the NoC topology is necessary to reach the diverse design goals of different chips. We introduce sparse Hamming graph, a novel NoC topolog…
View article: Neural Graph Databases
Neural Graph Databases Open
Graph databases (GDBs) enable processing and analysis of unstructured, complex, rich, and usually vast graph datasets. Despite the large significance of GDBs in both academia and industry, little effort has been made into integrating them …
View article: ProbGraph: High-Performance and High-Accuracy Graph Mining with Probabilistic Set Representations
ProbGraph: High-Performance and High-Accuracy Graph Mining with Probabilistic Set Representations Open
Important graph mining problems such as Clustering are computationally demanding. To significantly accelerate these problems, we propose ProbGraph: a graph representation that enables simple and fast approximate parallel graph mining with …
View article: PolarFly: A Cost-Effective and Flexible Low-Diameter Topology
PolarFly: A Cost-Effective and Flexible Low-Diameter Topology Open
In this paper we present PolarFly, a diameter-2 network topology based on the Erdos-Renyi family of polarity graphs from finite geometry. This is a highly scalable low-diameter topology that asymptotically reaches the Moore bound on the nu…