-
Notifications
You must be signed in to change notification settings - Fork 53
Added support for compaction metrics in druid. #569
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@abdasgupta thanks for the PR. Please address my review comments and also add documentation about the metrics being exposed by druid, similar to https://github.com/gardener/etcd-druid/blob/master/docs/multinode-metrics.md, under docs/monitoring/compaction-metrics.md
.
Please also edit the release note to
Druid now exposes metrics related to snapshot compaction, on default port 8080. Please expose the desired metrics port via the etcd-druid service to allow metrics to be scraped by a Prometheus instance.
/hold until druid v0.18 is released |
/unhold |
How to categorize this PR?
/area disaster-recovery
/kind technical-debt
What this PR does / why we need it:
This PR exposes metrics related to compaction job initiated by druid.
Which issue(s) this PR fixes:
Fixes #515
Special notes for your reviewer:
As per the issue description, this PR exposes metrices for number of successful jobs, number of failed jobs, time taken to complete last job, number of delta events compacted, but it does not expose CPU and Memory consumption by last successful job. Resource consumption is the gauge that can only be measured from inside the job. Druid doesn't have access to that.
Release note: