PoZoK

PostgreSQL on ZFS on Kubernetes

I've seen on that page that PoZoL was "PostgreSQL on ZFS on Linux", so this is PoZoK - same thing but on Kubernetes, with CNPG.

This is a repository of scripts, manifests, etc., to run/test/demo PostgreSQL on Kubernetes, with CNPG and OpenEBS LocalPV-ZFS.

The examples use Linode but are easily adaptable to pretty much any other cloud provider. I'm using Linode because they're affordable (way cheaper than AWS) and their Kubernetes clusters start quickly (way faster than AWS). I've also had success with Digital Ocean, Scaleway; and I also run PostgreSQL production workloads (with CNPG and ZFS) on Hetzner. These are not endorsements; just some random person on the Internet telling you "trust me sib, this works" so take it with a grain of salt and/or a pinch of spice!

Cluster provisioning

Warning: if you change the node type (e.g. to put a smaller node), you will probably want to change the "size" parameter in the disk setup. The size is the size (in MB) of the ZFS partition. It will be subtracted from the system disk. This means that the system disk must be big enough to accommodate a "normal" system partition (say, anything between 10-100 GB depending on the number and size of images, local volumes, etc. that you need) + that ZFS partition.

REGION=fr-par

CLUSTER_NAME=lke-db-$RANDOM
lin lke cluster-create --label $CLUSTER_NAME --region $REGION --k8s_version 1.31 \
  --node_pools.type g6-standard-6 \
  --node_pools.count 3 \
  --node_pools.autoscaler.enabled true \
  --node_pools.autoscaler.min 3 \
  --node_pools.autoscaler.max 10 \
  --node_pools.disks '[{"type": "raw", "size": 204800}]' \
  #

CLUSTER_ID=$(lin  lke clusters-list --label $CLUSTER_NAME  --json | jq .[].id)
while ! lin lke kubeconfig-view $CLUSTER_ID; do
  sleep 10
done
lin lke kubeconfig-view $CLUSTER_ID --json | jq -r .[].kubeconfig | base64 -d > kubeconfig.$CLUSTER_ID
export KUBECONFIG=kubeconfig.$CLUSTER_ID

Install ZFS Local-PV

helm upgrade --install --namespace zfs-system --create-namespace \
 --repo https://openebs.github.io/zfs-localpv \
 zfs zfs-localpv

# Note: if you're using Talos (and possibly other distros where `/home` might be immutable),
# you might want to add `--set zfsNode.encrKeysDir=/var/zfsencrkeys` to that Helm install.

k apply -f zfs-setup-setup.yaml
k apply -f storageclass.yaml

Install kube-prometheus-stack

(If you want observability.)

helm upgrade --install \
  --repo https://prometheus-community.github.io/helm-charts \
  --namespace prom-system --create-namespace \
  --set prometheus.prometheusSpec.podMonitorSelectorNilUsesHelmValues=false \
  --set prometheus.prometheusSpec.serviceMonitorSelectorNilUsesHelmValues=false \
  prom kube-prometheus-stack

Install CNPG

helm upgrade --install --namespace cnpg-system --create-namespace \
  --repo https://cloudnative-pg.io/charts/ \
  cloudnative-pg cloudnative-pg

Fun options to add:

--set monitoring.podMonitorEnabled=true
--set monitoring.grafanaDashboard.create=true
--set monitoring.grafanaDashboard.namespace=prom-system

Install Rancher Local Path

kubectl apply -f https://raw.githubusercontent.com/rancher/local-path-provisioner/v0.0.30/deploy/local-path-storage.yaml

Create bucket for CNPG backups

BUCKET_NAME=cnpg-backups-$RANDOM
lin obj mb $BUCKET_NAME --cluster $REGION-1
lin object-storage keys-create --label $BUCKET_NAME \
  --bucket_access.bucket_name=$BUCKET_NAME \
  --bucket_access.permissions=read_write \
  --bucket_access.region=$REGION \
  --json > key.$BUCKET_NAME.json

cat > env.$BUCKET_NAME <<EOF
export AWS_ACCESS_KEY_ID=$(jq < key.$BUCKET_NAME.json .[0].access_key)
export AWS_SECRET_ACCESS_KEY=$(jq < key.$BUCKET_NAME.json .[0].secret_key)
export AWS_ENDPOINT_URL=https://$(jq < key.$BUCKET_NAME.json .[0].regions[0].s3_endpoint)/
export AWS_DEFAULT_REGION=$(jq < key.$BUCKET_NAME.json .[0].regions[0].id)
export BUCKET_NAME=$(jq < key.$BUCKET_NAME.json .[0].bucket_access[0].bucket_name)
EOF

Note: the variables are named AWS_* even when we're not using AWS. These are the default variable names used by the aws CLI and a few other tools. Using these variable names means that we can use these tools without having to write a configuration profile, or set additional command-line flags or environment variables.

Create some databases

First, a very simple database with just a primary and a replica:

kubectl apply -f cluster-minimal.yaml
watch kubectl get clusters,pods

Then, a more complex one with backups, separate WAL storage, etc:

. ./env.$BUCKET_NAME
CLUSTER_NAME=prod envsubst < cluster-maximal.yaml | kubectl apply -f-
watch kubectl get clusters,pods

Benchmarks

./benchmark.sh storageclass <storageClassName> [scale] [clients] [size]

Where:

scale is the -s (scaling) parameter for pgbench (default value: 1)
clients is the --clients parameter for pgbench (default value: 1)
size is the size of the PV created for the database (default value: 10G)

This will:

create a cluster
wait for the cluster to be up and running
start a pgbench on the primary
after the pgbench it will show the size and apparent size of the database
delete the cluster

./benchmark.sh cluster <clusterName> [scale] [clients]

This will only run the benchmark on the given cluster. It won't create (or destroy) that cluster.

Clean up the bucket

# Make sure that the environment file is loaded first
# Check the content of the bucket
aws s3 ls s3://$BUCKET_NAME
# Destroy everything in the bucket
aws s3 rm --recursive s3://$BUCKET_NAME
# Destroy the bucket itself (this is specific to Linode; other providers will use other commands)
lin obj rb $BUCKET_NAME --cluster $REGION-1

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
README.md		README.md
add-bos-backups.sh		add-bos-backups.sh
add-monitoring.sh		add-monitoring.sh
add-vs-backups.sh		add-vs-backups.sh
benchmark.sh		benchmark.sh
cluster-maximal.yaml		cluster-maximal.yaml
cluster-minimal.yaml		cluster-minimal.yaml
cluster-pgbench.yaml		cluster-pgbench.yaml
cluster-production.yaml		cluster-production.yaml
cluster-restore.yaml		cluster-restore.yaml
env.example.ovh		env.example.ovh
setup-from-snapshot-v1.sh		setup-from-snapshot-v1.sh
setup-from-snapshot-v2.sh		setup-from-snapshot-v2.sh
setup-osm-cluster.sh		setup-osm-cluster.sh
storageclass.yaml		storageclass.yaml
zfs-exporter-podmonitor.yaml		zfs-exporter-podmonitor.yaml
zfs-setup-debian.yaml		zfs-setup-debian.yaml
zfs-setup-talos.yaml		zfs-setup-talos.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PoZoK

Cluster provisioning

Install ZFS Local-PV

Install kube-prometheus-stack

Install CNPG

Install Rancher Local Path

Create bucket for CNPG backups

Create some databases

Benchmarks

Clean up the bucket

About

Uh oh!

Releases

Packages

Languages

jpetazzo/pozok

Folders and files

Latest commit

History

Repository files navigation

PoZoK

Cluster provisioning

Install ZFS Local-PV

Install kube-prometheus-stack

Install CNPG

Install Rancher Local Path

Create bucket for CNPG backups

Create some databases

Benchmarks

Clean up the bucket

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages