Skip to content

Latest commit

 

History

History
58 lines (45 loc) · 3.96 KB

README.md

File metadata and controls

58 lines (45 loc) · 3.96 KB

Byzantine Fault Tolerant CRDTs

This work is mainly inspired by implementing Martin Kleppmann's 2022 paper on Making CRDTs Byzantine Fault Tolerant1 on top of a simplified Automerge implementation.

The goal is to show a working prototype that demonstrated in simple code the ideas behind

  1. An Automerge-like CRDT
  2. How a primitive list CRDT can be composed to create complex CRDTs like JSON
  3. How to add Byzantine Fault Tolerance to arbitrary CRDTs

Unlike most other CRDT implementations, I leave out many performance optimizations that would make the basic algorithm harder to understand.

Check out the accompanying blog post for this project!

Benchmarks

Altough this implementation does not optimize for performance, it still nonetheless performs quite well.

Benchmarking happened on a 2019 Macbook Pro with a 2.6GHz i7. Numbers are compared to Automerge which report their performance benchmarks here

# Ops Raw String (JS) Ours (basic) Ours (BFT) Automerge (JS) Automerge (Rust)
10k n/a 0.081s 1.793s 1.6s 0.047s
100k n/a 9.321s 38.842s 43.0s 0.597s
All (259k) 0.61s 88.610s 334.960s Out of Memory 1.780s
Memory 0.1MB 27.6MB 59.5MB 880MB 232.5MB

Flamegraph

To get some flamegraphs of the time graph on MacOS, run:

sudo cargo flamegraph --dev --root --bench speed

Further Work

This is mostly a learning/instructional project but there are a few places where performance improvements are obvious:

  1. This is backed by std::Vec which isn't great for random insert. Replace with a B-tree or something that provides better insert and find performance
    1. Diamond Types and Automerge (Rust) use a B-tree
    2. Yjs is backed by a doubly linked-list and caches last ~5-10 accessed locations (assumes that most edits happen sequentially; seeks are rare)
    3. (funnily enough, main peformance hit is dominated by find and not insert, see this flamegraph)
  2. Avoid calling find so many times. A few Automerge optimizations that were not implemented
    1. Use an index hint (especially for local inserts)
    2. Skipping the second find operation in integrate if sequence number is already larger
  3. Improve storage requirement. As of now, a single Op weighs in at over 168 bytes. This doesn't even fit in a single cache line!
  4. Implement 'transactions' for a group of changes that should be considered atomic.
    1. This would also speed up Ed25519 signature verification time by batching.
    2. For example, a peer might create an atomic 'transaction' that contains a bunch of changes.
  5. Currently, each character is a single op. Similar to Yjs, we can combine runs of characters into larger entities like what André, Luc, et al.2 suggest
  6. Implement proper persistence using SQLLite or something similar
  7. Compile the project to WASM and implement a transport layer so it can be used in browser. Something similar to Yjs' WebRTC Connector could work.

Acknowledgements

Thank you to Nalin Bhardwaj for helping me with my cryptography questions and Martin Kleppmann for his teaching materials and lectures which taught me a significant portion of what I've learned about distributed systems and CRDTs.

Footnotes

  1. Kleppmann, Martin. "Making CRDTs Byzantine Fault Tolerant." Proceedings of the 9th Workshop on Principles and Practice of Consistency for Distributed Data. 2022.

  2. André, Luc, et al. "Supporting adaptable granularity of changes for massive-scale collaborative editing." 9th IEEE International Conference on Collaborative Computing: Networking, Applications and Worksharing. IEEE, 2013.