Do we really need a third Apache project for columnar data? (2017)

Apache Parquet and Apache ORC have become a popular file formats for storing data in the Hadoop ecosystem. Their primary value proposition ... | Continue reading


@dbmsmusings.blogspot.com | 3 years ago

Slog: Cheating the low-latency vs. strict serializability tradeoff

This post provides an overview of a new technology from my lab that was recently published in VLDB 2019 . In short: SLOG is a geographicall... | Continue reading


@dbmsmusings.blogspot.com | 4 years ago

An explanation of the difference between Isolation levels vs. Consistency levels

In several recent posts, we discussed two ways to trade off correctness for performance in database systems. In particular, I wrote two pos... | Continue reading


@dbmsmusings.blogspot.com | 4 years ago

Overview of Consistency Levels in Database Systems

Database systems typically give users the ability to trade off correctness for performance. We have spent the previous two posts in this s... | Continue reading


@dbmsmusings.blogspot.com | 4 years ago

The dangers of conditional consistency guarantees

The ease of writing an application on top of infrastructure that guarantees consistency can not be overstated . It’s just so much easier to ... | Continue reading


@dbmsmusings.blogspot.com | 4 years ago

Correctness Anomalies Under Serializable Isolation

Most database systems support multiple isolation levels that enable their users to trade off exposure to various types of application a... | Continue reading


@dbmsmusings.blogspot.com | 4 years ago

Correctness Anomalies Under Serializable Isolation

Most database systems support multiple isolation levels that enable their users to trade off exposure to various types of application a... | Continue reading


@dbmsmusings.blogspot.com | 4 years ago

DBMS Musings: Apache Arrow vs. Parquet and Orc

Apache Parquet and Apache ORC have become a popular file formats for storing data in the Hadoop ecosystem. Their primary value proposition ... | Continue reading


@dbmsmusings.blogspot.com | 4 years ago

Introduction to Transaction Isolation Levels

For decades, database systems have given their users multiple isolation levels to choose from, ranging from some flavor of “serializability... | Continue reading


@dbmsmusings.blogspot.com | 4 years ago

Introduction to Transaction Isolation Levels

For decades, database systems have given their users multiple isolation levels to choose from, ranging from some flavor of “serializability... | Continue reading


@dbmsmusings.blogspot.com | 4 years ago

It’s Time to Move on from Two Phase Commit

The two-phase commit protocol (2PC) has been used in enterprise software systems for over three decades . It has been an an incredibly impa... | Continue reading


@dbmsmusings.blogspot.com | 5 years ago

Partitioned consensus and its impact on Spanner’s latency

In a post that I published in September , I described two primary approaches for performing consensus in distributed systems, and how the c... | Continue reading


@dbmsmusings.blogspot.com | 5 years ago

Distributed consistency at scale: Spanner vs. Calvin

Introduction In 2012, two research papers were published that described the design of geographically replicated, consistent, ACID complia... | Continue reading


@dbmsmusings.blogspot.com | 5 years ago

NewSQL databases fail to guarantee consistency and I blame Spanner

The CAP theorem famously states that it is impossible to guarantee both consistency and availability in the event of a network partition. S... | Continue reading


@dbmsmusings.blogspot.com | 5 years ago