When I was first introduced to Oracle years ago, I was surprised at how much it ...

freels · on March 31, 2017

I agree with you that it's not an original analogy, in fact I think you can safely say that most larger, performance-oriented systems end up looking more like an OS than not. There's a large overlap in the problems that need to be solved and the resources available to distributed systems, that I'd be more skeptical of an implementation that didn't lean heavily on the OS, or at least borrow the general patterns.

To answer your question about consistency, we have implemented a transaction resolution mechanism inspired by Calvin. The whitepaper contains a lot more detail, but the short answer is that we use a consistent, distributed transaction log to provide a global order of all read-write transactions, which are executed deterministically on the responsible data partitions. This allows FaunaDB to provide a guarantee of strict-serializability for read-write transactions. Read-only transactions are by default a bit more relaxed and provide a hard guarantee of serializability in order to avoid global coordination overhead.

dunkelheit · on April 1, 2017

The key difference with conventional RDBMS systems is that transactions are deterministic. When interacting with a traditional RDBMS a client opens a transaction with BEGIN statement, then does reads and writes interactively at its own pace. In contrast, in Calvin-inspired systems clients send their transactions as one unit written in a severely restricted query language (disclaimer - not a FaunaDB engineer).

inopinatus · on March 31, 2017

Just to flip the analogy, most OS file systems are essentially databases for arbitrary sized binary values, with a hierarchical key structure.

walterbell · on March 31, 2017

Yes, big databases can disable the OS scheduler (CPU pinning) and file caching (O_DIRECT) so they can perform workload-aware optimizations for better performance than general-purpose OS mechanisms.