blockstore: Adjust rocksdb threadpool config #6166

steviez · 2025-05-08T21:49:18Z

Problem

A previous change wired up the ability to configure rocksdb threadpool sizes through BlockstoreOptions. That change kept the threadpool sizes the same, but unintentionally limited how many concurrent flushes and compactions could be scheduled in each pool. The result was reduced parallelism with mostly idle threadpools. The previous change is #4214

Summary of Changes

This change restores the previous level of allowed parallelism in the compact and flush threadpools

Who is Effected

With current MNB load, I believe the node had to have the following in order to experience a noticeable issue:

Running with --enable-rpc-transaction-history
Running with larger than default --limit-ledger-size
Potentially getting actual RPC load on Blockstore methods; these increase reads which compete with compaction

Item 1. introduces two extra columns that increase our total amount of data written to the Blockstore by 2-3x. Item 2. opens up to some elevated I/O to clean those two extra columns (see #6017 for more details).

Testing

With the current tip of master, I put a breakpoint here, and observed the following (noting that the pool sizes scale with N threads):

// v2.1
(gdb) print res
$1 = {max_flushes
8000
 = 8, max_compactions = 24}

// tip of master & v2.2
(gdb) print res
$1 = {max_flushes = 1, max_compactions = 1}

// this PR
(gdb) print res
$1 = {max_flushes = 8, max_compactions = 32}

You may notice that max_compaction changes in this PR vs. v2.1. In rocks, the size of the threadpool and the number of jobs that can be scheduled in parallel are separate numbers:

In <= v2.1 on this box, we'd create 32 threads for compaction but only allow up to 24 jobs to be run in parallel
- More details on why in validator: Add CLI args to control rocksdb threadpool sizes #4214
This PR allows all 32 created threads to be utilized at once
- I'd argue this makes the configuration more intuitive, and we can reduce the threadpool size subsequently

A previous change wired up the ability to configure rocksdb threadpool sizes through BlockstoreOptions. That change kept the threadpool sizes the same, but unintentionally limited how many concurrent flushes and compactions could be scheduled in each pool. The result was reduced parallelism with mostly idle threadpools. This change restores the previous level of allowed parallelism in the compact and flush threadpools

steviez · 2025-05-08T21:50:49Z

I'm also hoping to get some runtime with this patch from some people who have seen issues with v2.2.

Assuming I have correctly identified the issue, we will need this for v2.2 as well. I see two options:

Merge this PR and BP it to v2.2
Revert validator: Add CLI args to control rocksdb threadpool sizes #4214 and BP the revert to v2.2

t-nelson · 2025-05-08T22:10:58Z

I'm also hoping to get some runtime with this patch from some people who have seen issues with v2.2.

Assuming I have correctly identified the issue, we will need this for v2.2 as well. I see two options:
* Merge this PR and BP it to v2.2

* Revert [validator: Add CLI args to control rocksdb threadpool sizes #4214](https://github.com/anza-xyz/agave/pull/4214) and BP the revert to v2.2

are there any non-obvious concerns related to bping this pr? seems to be (mostly) reverting us to 2.1 behavior

steviez · 2025-05-08T22:45:52Z

are there any non-obvious concerns related to bping this pr?

No, not really. I'm technically using a different function to modify the config, but I have confidence it is doing the same thing that it was before via the gdb inspection + calling ps -p PID -T to examine created threads

seems to be (mostly) reverting us to 2.1 behavior

Yeah, more or less. Also leave the CLI args hooked up which aren't in v2.1

codecov-commenter · 2025-05-08T23:10:00Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 82.8%. Comparing base (a1f90ac) to head (87fe4ae).
Report is 3 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff            @@
##           master    #6166     +/-   ##
=========================================
- Coverage    82.8%    82.8%   -0.1%     
=========================================
  Files         842      842             
  Lines      377696   377690      -6     
=========================================
- Hits       313061   313048     -13     
- Misses      64635    64642      +7

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

mergify · 2025-05-09T00:08:31Z

Backports to the beta branch are to be avoided unless absolutely necessary for fixing bugs, security issues, and perf regressions. Changes intended for backport should be structured such that a minimum effective diff can be committed separately from any refactoring, plumbing, cleanup, etc that are not strictly necessary to achieve the goal. Any of the latter should go only into master and ride the normal stabilization schedule. Exceptions include CI/metrics changes, CLI improvements and documentation updates on a case by case basis.

A previous change wired up the ability to configure rocksdb threadpool sizes through BlockstoreOptions. That change kept the threadpool sizes the same, but unintentionally limited how many concurrent flushes and compactions could be scheduled in each pool. The result was reduced parallelism with mostly idle threadpools. This change restores the previous level of allowed parallelism in the compact and flush threadpools (cherry picked from commit 93df405)

#6179) blockstore: Adjust rocksdb threadpool config (#6166) A previous change wired up the ability to configure rocksdb threadpool sizes through BlockstoreOptions. That change kept the threadpool sizes the same, but unintentionally limited how many concurrent flushes and compactions could be scheduled in each pool. The result was reduced parallelism with mostly idle threadpools. This change restores the previous level of allowed parallelism in the compact and flush threadpools (cherry picked from commit 93df405) Co-authored-by: steviez <steven@anza.xyz>

…xyz#6166) (anza-xyz#6179) blockstore: Adjust rocksdb threadpool config (anza-xyz#6166) A previous change wired up the ability to configure rocksdb threadpool sizes through BlockstoreOptions. That change kept the threadpool sizes the same, but unintentionally limited how many concurrent flushes and compactions could be scheduled in each pool. The result was reduced parallelism with mostly idle threadpools. This change restores the previous level of allowed parallelism in the compact and flush threadpools (cherry picked from commit 93df405) Co-authored-by: steviez <steven@anza.xyz>

t-nelson approved these changes May 9, 2025

View reviewed changes

t-nelson added the v2.2 Backport to v2.2 branch label May 9, 2025

steviez merged commit 93df405 into anza-xyz:master May 9, 2025
48 checks passed

steviez deleted the bstore_fix_bg_jobs branch May 9, 2025 15:29

mergify bot mentioned this pull request May 9, 2025

v2.2: blockstore: Adjust rocksdb threadpool config (backport of #6166) #6179

Merged

steviez added this to Thread and Thread Pool Audit May 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

blockstore: Adjust rocksdb threadpool config #6166

blockstore: Adjust rocksdb threadpool config #6166

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

blockstore: Adjust rocksdb threadpool config #6166

blockstore: Adjust rocksdb threadpool config #6166

Uh oh!

Conversation

Uh oh!

Problem

Summary of Changes

Who is Effected

Testing

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

Uh oh!