Skip to content

A datastore aiming at linear scalability up to the yottabyte range. Inspired by dynamo and cassandra.

License

Notifications You must be signed in to change notification settings

yottaStore/documentation

Repository files navigation

Introduction

Yotta Store is a next generation storage system aiming to scale out to the yotta byte range and scale up to millions of concurrent read and writes per record. The goal is to have two orders of magnitude more throughput than DynamoDB, dollar per dollar, while maintaining a sub-ms latency. Check our benchmarks

Yotta Store is built on top of a 512 bit distributed machine, with a 4kib word size. We try to design a system which can exploit the capabilities of modern hardware and software, like NVMe disks or io_uring. Read more in the docs

Main features

  • Linear scalability, up to 10^9 nodes and 10 yotta bytes of addressable space.
  • Anti fragility, the multi tenant setup increase reliability and availability with load.
  • Self configuring and optimizing, thanks to machine learning
  • Strong consistency guarantees, aiming at sub ms latency.
  • Cheap transactions and indexes, at around o(n).
  • Storage decoupled from compute with a serverless architecture.
  • Two orders of magnitude faster than DynamoDB, dollar per dollar.

Techniques used

Yotta Store does not create any new technique, but uses existing ones in a novel combination:

Problem Solution Description
Partitioning and replication ReBaR: Rendezvous Based Routing A generalization of consistent hashing, for agreeing k
choices in a stable way and with minimal information exchange
Highly available read and writes YottaFS A wait free userspace filesystem, designed around NVMe devices and CRDTs.
Hot records YottaSelf Hot records can be sandboxed in dedicated partitions.
Transactions Modified Warp Thanks to the modified warp algorithm no distributed consensus is needed.
Indexes, queries across keys Partitioned indexes Indexes are sharded and lazily built on the same partition seeing the changes.
Membership and failure detection Gossip agreement protocol Thanks to GAP and ReBaR the expected cost is o(log(n)) deterministically.

Inspirations

About

A datastore aiming at linear scalability up to the yottabyte range. Inspired by dynamo and cassandra.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published