mempool: Add push-pull gossip protocol (CAT) #1472

hvanz · 2023-10-11T10:14:55Z

Closes: #2027.

Relates to #1058.

The CAT (for Content-Addressable Transaction) pool is a gossip protocol for the mempool originally implemented by Celestia. CAT is a push-pull protocol in contrast to CometBFT's default push protocol.

The code in this PR was ported from Celestia's feature/cat branch. The original protocol is built on top of the priority mempool implementation (aka v1), which existed in CometBFT until v0.37. The current code was ported on top of CometBFT's default mempool implementation (CListMempool), so we had to make to some changes to adapt it to the different underlying implementation.

PR checklist

Tests written/updated
Changelog entry added in .changelog (we use unclog to manage our changelog)
Updated relevant documentation (docs/ or spec/) and code comments

hvanz · 2023-10-16T08:41:52Z

These are the results of some preliminary experiments run on a laptop using the e2e framework. For more thorough results we would still need to run these experiments with around 200 nodes in the cloud, as we do for the QA tests, probably also with a different network topology.

Here, each experiment instance has:

8 nodes, in a complete graph topology, that is, each node is connected to all the others. Note that this topology is where a CAT mempool should perform better and where the default mempool would perform worst.
The first node receives all the transaction load.
Each run lasts 4 minutes.

Instances are defined by the permutation of transaction load rate (r) in tx/s and transaction size (s) in bytes, with the following values:

r = [100, 200, 400, 800]
s = [256, 512, 1024, 2048]

For each (r,s) value, we run two consecutive instances: one with the CAT mempool and then one with the default mempool, which we call Flood, as a baseline for comparison.

We can see in the above graphs that:

Bandwidth consumption (bytes sent and received) of CAT is always lower than Flood, with Flood being 3 to 6 times larger.
The chain height, chain size, and block size are, in each instance, almost the same value for CAT and Flood, meaning that CAT does not miss transactions when compared to Flood, or that all transactions sent to the nodes are eventually included in the chain. For big transactions, the chain size of Flood is lower than CAT. This is expected as CAT is supposed to work better with large transactions.
Remember that the saturation point defined in the last QA experiments is r=400, s=1024, meaning that the performance of a node is degraded for that value or bigger. This can be seen in the mempool size metrics, where for higher values the nodes become unstable.
The metric 'Already received txs' counts the number of times a received transaction is already in the mempool cache. In all CAT instances, this metric is zero: the node receiving transactions from the client and pushing to the other nodes does not receive the transactions back from them. And the other nodes receive less duplicated transactions than with Flood.

hvanz · 2023-10-16T15:55:39Z

These are the same experiments as above but with the following topology, which is more realistic, where node 1 receives all the load:

These graphs look similar to the graphs of the above experiment. The main difference is that now the bandwidth of Flood is 2 times higher than that of CAT, instead of 3 to 6 times higher. This was expected, as now the push-pull protocol needs to transmit the transaction in various steps from node 1 to node 8.

github-actions · 2023-10-28T00:12:03Z

This pull request has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

mempool/sync_reactor.go

config/config.go

config/toml.go

faddat · 2024-01-05T10:27:36Z

but I must kindly ask that we backport this.

It is badly needed.

adizere · 2024-01-05T13:03:50Z

but I must kindly ask that we backport this.

It is badly needed.

Agree. Thanks Jacob!

Is there a specific app-chain team that is waiting on it? If so, my bad, I was not aware!

Even more importantly, are Celestia mainnet nodes employing the CAT mempool? Our last chat with their team (Oct/Nov) we agreed we'd push CAT over the finish line if they give us the green light they will use it in their mainnet, and the Comet+Celestia teams will do complementary testing of CAT. But I have not re-checked since then so my info is very stale. We should get an update. cc @cmwaters

adizere · 2024-01-08T17:15:22Z

@hvanz should we deprecate this PR in favor of #1971 ?

hvanz · 2024-01-09T10:01:57Z

@hvanz should we deprecate this PR in favor of #1971 ?

I think there's no need to. This PR is now up-to-date with main (and it contains all experiment results).

adizere · 2024-01-09T14:24:54Z

Then deprecate 1971 in favor of present PR? Not sure 1971 still has anything worth keeping, let me know if so. cc @faddat

BTW thanks for bringing the present PR up-to-date with main Hernán !

faddat · 2024-01-09T14:32:03Z

Big thanks from me too.

😍

I'll gladly close the other.

adizere · 2024-06-19T09:14:10Z

For the moment deprioritized in favor of #3297

hvanz added 3 commits October 5, 2023 11:00

Add CAT mempool reactor

094c671

Add missing changes to rpc/core/mempool.go

a13d6de

e2e: Add MempoolReactor to manifest

2c39819

hvanz added the mempool label Oct 11, 2023

hvanz added this to the 2023-Q4 milestone Oct 11, 2023

hvanz added 7 commits October 11, 2023 15:33

Merge branch 'main' into experimental/cat

4c60b82

+ update cat reactor following merge

e310c6d

Fix lint

01b3e2e

Revert to fix lint

034a980

More fixes

0e68188

make proto-gen

6d09bbb

Add SyncReactor interface

f1a0594

This was referenced Oct 13, 2023

cat #1429

Closed

pull celestia/cmwaters CAT mempool implementation into cometbft #1426

Closed

hvanz added 2 commits October 13, 2023 11:54

Refactor MempoolTx into CListEntry to fix lint

47a232d

Add IsEmpty method to CListEntry

1d0622d

hvanz self-assigned this Oct 13, 2023

Increase defaultGossipDelay and remove jitter

a53fe88

e2e: Add pex_reactor and log_level to manifest

0e926aa

github-actions bot added the stale For use by stalebot label Oct 28, 2023

hvanz added wip Work in progress and removed stale For use by stalebot labels Oct 31, 2023

thanethomson reviewed Nov 3, 2023

View reviewed changes

mempool/sync_reactor.go Outdated Show resolved Hide resolved

mempool/sync_reactor.go Outdated Show resolved Hide resolved

config/config.go Outdated Show resolved Hide resolved

thanethomson reviewed Nov 3, 2023

View reviewed changes

config/toml.go Outdated Show resolved Hide resolved

hvanz added 3 commits November 8, 2023 01:37

Merge branch 'main' into experimental/cat

a3c3495

Rename (mempool_)reactor to gossip_protocol

3c3c48d

Merge branch 'main' into experimental/cat

1060ed6

adizere modified the milestones: 2023-Q4, 2024-Q2 Jan 8, 2024

hvanz added 13 commits January 8, 2024 21:53

Merge branch 'main' into experimental/cat

251e8a9

Merge branch 'main' into experimental/cat

4d0a1d1

Fix lints

8d7941b

Reuse node.WaitSyncP2PReactor instead of mempool.SyncReactor

cebcb9c

fix spelling

24cd785

make mockery

27c85b6

Add required comments to proto messages

c37ff73

Fix lint

03fc5ed

Merge branch 'main' into experimental/cat

8124ad0

make proto-gen

4040010

Fix lint: import order

2ba6d7a

Merge branch 'main' into experimental/cat

af2 67E6 2351

No need to SetLogger again

927c0a0

Assign the correct base reactor

3dc7b60

adizere mentioned this pull request Jan 12, 2024

Support for the CAT (push-pull gossip) mempool #2027

Closed

adizere linked an issue Jan 12, 2024 that may be closed by this pull request

Support for the CAT (push-pull gossip) mempool #2027

Closed

adizere removed this from the 2024-Q2 milestone Jan 12, 2024

adizere closed this Jun 19, 2024

zrbecker deleted the experimental/cat branch February 7, 2025 17:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

mempool: Add push-pull gossip protocol (CAT) #1472

mempool: Add push-pull gossip protocol (CAT) #1472

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mempool: Add push-pull gossip protocol (CAT) #1472

mempool: Add push-pull gossip protocol (CAT) #1472

Uh oh!

Conversation

Uh oh!

PR checklist

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!