GitHub - alwaysLinger/rbd-kv: A distributed kv storage based on RAFT and BadgerDB

RAFT BadgerDB distributed KV

BadgerDB as Hashicorp/Raft FSM Implementation

This repository implements hashicorp/raft's FSM interface using BadgerDB as the underlying storage engine. BadgerDB was chosen for its combination of high performance and ease of implementation, making it an ideal choice for an FSM implementation.

Simple Implementation

Thread-Safe FSM Apply: Raft's single-threaded Apply design eliminates concerns about BadgerDB's transaction conflict detection
High Performance I/O: BadgerDB provides excellent read and write performance capabilities
Comprehensive Snapshot Support: Built-in backup and restore functionalities make FSM implementation straightforward

Features

Strong consistency with Raft consensus protocol
Serializable and linearizable read
Support MVCC
Watch mechanism for key events
SSD design friendly, FSM storage size unlimited, thanks to BadgerDB
Use grpc as client interface
Support haschicorp/raft BatchingFSM, with this feature enabled, you will get higher throughput but may increase log replication latency, potentially losing more raft logs that haven't been applied to the state machine which can be re-applied by WAL replay.

Why?

Why write this project?
One of my application uses memory as a level2 cache, so I needed a general-purpose level1 cache (performance doesn't need to match in-memory databases like Redis, as most requests won't reach the database, but I still want it to be fast enough). However, I required better consistency and reliable KV storage, which led to the creation of this project.
Why the FSM provides such a lazy compaction and persistence strategy?
For a higher write throughput and there is also no need for aggressive compaction or syncing. Both RAFT and BadgerDB provide WAL mechanisms, giving us reliable data safety guarantees. It's important to understand that RAFT only ensures log replication across the cluster, while reliable storage depends entirely on the FSM implementation. To achieve higher write throughput, I've implemented a relatively aggressive RAFT WAL persistence strategy (which still remains safe due to replication), while providing a more conservative FSM data compaction and persistence strategy. Since this project also uses BadgerDB as storage for RAFT logs and RAFT metadata, I believe these two strategies are relatively reasonable.

Example

When bootstrap a cluster for the first time, you need to explicitly specify the leader peer address to join. Like node2 and node3 use the --join-addr flag to specify the address:
./rbkv --grpc-addr=localhost:9501 --raft-addr=localhost:9601 --log-dir=/tmp/node1 --kv-dir=/tmp/node1
./rbkv --grpc-addr=localhost:9502 --join-addr=localhost:9501 --raft-addr=localhost:9602 --log-dir=/tmp/node2 --kv-dir=/tmp/node2
./rbkv --grpc-addr=localhost:9503 --join-addr=localhost:9501 --raft-addr=localhost:9603 --log-dir=/tmp/node3 --kv-dir=/tmp/node3
Once a node has successfully joined a cluster, you don't need to specify the --join-addr flag ever again. The node will automatically rejoin the cluster using its stored state in any order:
./rbkv --grpc-addr=localhost:9502 --raft-addr=localhost:9602 --log-dir=/tmp/node2 --kv-dir=/tmp/node2
./rbkv --grpc-addr=localhost:9503 --raft-addr=localhost:9603 --log-dir=/tmp/node3 --kv-dir=/tmp/node3
./rbkv --grpc-addr=localhost:9501 --raft-addr=localhost:9601 --log-dir=/tmp/node1 --kv-dir=/tmp/node1

TODO

Support multi keys transaction
Smarter client
Cmdline tool

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
error		error
internal		internal
pb		pb
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
README_CN.md		README_CN.md
go.mod		go.mod
go.sum		go.sum
main.go		main.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RAFT BadgerDB distributed KV

BadgerDB as Hashicorp/Raft FSM Implementation

Simple Implementation

Features

Why?

Example

TODO

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

alwaysLinger/rbd-kv

Folders and files

Latest commit

History

Repository files navigation

RAFT BadgerDB distributed KV

BadgerDB as Hashicorp/Raft FSM Implementation

Simple Implementation

Features

Why?

Example

TODO

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages