Open
Description
Was: tendermint/tendermint#9076
It must be easy to profile/understand “what is my node currently doing?”, and "where my bandwidth is going?"
- Reintroduce Filtered Logging tendermint/tendermint#8752
- This is "trivially" done as the filtering was removed in v0.35.x
- metrics: remove peer id label from p2p metrics to prevent cardinality explosion. tendermint/tendermint#2600. Solution not clear (see discussion)
- Agree on the best tradeoff [Estimation 2 days]
- Implementation [Estimation 5 days]
- Ask node operators/validators what information and/or tools are currently missing in terms of logging/monitoring.
- Rework this item once we have their answers (list of issues)
- logs: Move output after CTRL + C from stdout to file #69
- This was a request from our community call feedback
- Audit logging across all packages and determine whether the logs are useful/necessary and whether they're at the correct level (info, debug, error).
- Starting point: Reduce P2P logging spam [130 logs, estimation 5 days]
- Other packages [Estimations proportional to the number of logs]
- Review logs in the
consensus
package #907 - rpc, 90 logs
- cmd, 60 logs
- light, 51 logs
- libs, 49 logs
- node, 47 logs
- statesync, 46 logs
- tools, 36 logs
- logging improvements: blocksync module #53
- abci, 33 logs
- evidence, 32 logs
- mempool, 29 logs
- state, 23 logs
- other (privval, types,proxy), 33 logs
- Review logs in the
- Improve Metrics Collection
- How transactions spread using Mempool and Block parts?
- Log hash of transactions received at mempool and as block parts to determine the duplication of transaction data. Will be useful in determining gains of in solving Compact Block Propagation tendermint/tendermint#7932
- Partially addressed by mempool: Add metric to measure how many times a tx was received #634 New metrics to track duplicate votes and block parts (#896) -- Forward port #905
- p2p: fix logspam tendermint/tendermint#9756
- node: wire up logger to mempool reactor tendermint/tendermint#9885
- http_json_handler: Don't have err failed to write response log entire error #119
- RPC server: Long Log Messages #654