Daniel J. Abadi
YOU?
Author Swipe
View article: A <scp>uto</scp> C <scp>omp</scp> : Automated Data Compaction for Log-Structured Tables in Data Lakes
A <span>uto</span> C <span>omp</span> : Automated Data Compaction for Log-Structured Tables in Data Lakes Open
The proliferation of small files in data lakes poses significant challenges, including degraded query performance, increased storage costs, and scalability bottlenecks in distributed storage systems. Log-structured table formats (LSTs) suc…
View article: Are Database System Researchers Making Correct Assumptions about Transaction Workloads?
Are Database System Researchers Making Correct Assumptions about Transaction Workloads? Open
Many recent papers have contributed novel concurrency control and transaction processing algorithms that start by making an assumption about the transaction workload submitted by an application, and yield high performance (sometimes by an …
View article: Evaluating Learned Query Performance Prediction Models at LinkedIn: Challenges, Opportunities, and Findings
Evaluating Learned Query Performance Prediction Models at LinkedIn: Challenges, Opportunities, and Findings Open
Recent advancements in learning-based query performance prediction models have demonstrated remarkable efficacy. However, these models are predominantly validated using synthetic datasets focused on cardinality or latency estimations. This…
View article: The Cambridge Report on Database Research
The Cambridge Report on Database Research Open
On October 19 and 20, 2023, the authors of this report convened in Cambridge, MA, to discuss the state of the database research field, its recent accomplishments and ongoing challenges, and future directions for research and community enga…
View article: Bullion: A Column Store for Machine Learning
Bullion: A Column Store for Machine Learning Open
The past two decades have witnessed significant success in applying columnar storage to data warehousing and analytics. However, the rapid growth of machine learning poses new challenges. This paper presents Bullion, a columnar storage sys…
View article: FileScale
FileScale Open
File systems that store metadata on a single machine or via a shared-disk abstraction face scalability challenges, especially in contexts demanding the management of billions of files. Recent work has shown that employing shared-nothing, d…
View article: Detock: High Performance Multi-region Transactions at Scale
Detock: High Performance Multi-region Transactions at Scale Open
Many globally distributed data stores need to replicate data across large geographic distances. Since synchronously replicating data across such distances is slow, those systems with high consistency requirements often geo-partition data a…
View article: The Seattle report on database research
The Seattle report on database research Open
Every five years, a group of the leading database researchers meet to reflect on their community's impact on the computing industry as well as examine current research challenges.
View article: C5: Cloned Concurrency Control that Always Keeps Up
C5: Cloned Concurrency Control that Always Keeps Up Open
Asynchronously replicated primary-backup databases are commonly deployed to improve availability and offload read-only transactions. To both apply replicated writes from the primary and serve read-only transactions, the backups implement a…
View article: BullFrog: Online Schema Evolution via Lazy Evaluation
BullFrog: Online Schema Evolution via Lazy Evaluation Open
BullFrog is a relational DBMS that supports single-step schema migrations --- even those that are backwards incompatible --- without downtime, and without need for advanced warning. When a schema migration is submitted, BullFrog initiates …
View article: An overview of deterministic database systems
An overview of deterministic database systems Open
Deterministic database systems show great promise, but their deployment may require changes in the way developers interact with the database.
View article: Design Principles for Scaling Multi-core OLTP Under High Contention
Design Principles for Scaling Multi-core OLTP Under High Contention Open
Although significant recent progress has been made in improving the multi-core scalability of high throughput transactional database systems, modern systems still fail to achieve scalable throughput for workloads involving frequent access …