Added support for compaction metrics in druid. #569

abdasgupta · 2023-04-05T08:28:35Z

How to categorize this PR?

/area disaster-recovery
/kind technical-debt

What this PR does / why we need it:
This PR exposes metrics related to compaction job initiated by druid.

Which issue(s) this PR fixes:
Fixes #515

Special notes for your reviewer:
As per the issue description, this PR exposes metrices for number of successful jobs, number of failed jobs, time taken to complete last job, number of delta events compacted, but it does not expose CPU and Memory consumption by last successful job. Resource consumption is the gauge that can only be measured from inside the job. Druid doesn't have access to that.

Release note:

Druid now exposes metrics related to snapshot compaction, on default port 8080. Please expose the desired metrics port via the etcd-druid service to allow metrics to be scraped by a Prometheus instance.

shreyas-s-rao

@abdasgupta thanks for the PR. Please address my review comments and also add documentation about the metrics being exposed by druid, similar to https://github.com/gardener/etcd-druid/blob/master/docs/multinode-metrics.md, under docs/monitoring/compaction-metrics.md.

Please also edit the release note to

Druid now exposes metrics related to snapshot compaction, on default port 8080. Please expose the desired metrics port via the etcd-druid service to allow metrics to be scraped by a Prometheus instance.

pkg/metrics/metrics.go

controllers/compaction/reconciler.go

shreyas-s-rao · 2023-05-08T09:18:22Z

/hold until druid v0.18 is released

shreyas-s-rao · 2023-05-09T14:03:40Z

/unhold

controllers/compaction/metrics.go

docs/development/metrics.md

controllers/compaction/metrics.go

docs/development/metrics.md

controllers/compaction/metrics.go

abdasgupta requested a review from a team as a code owner April 5, 2023 08:28

abdasgupta force-pushed the metrics branch from de92ae3 to 1d419f1 Compare April 5, 2023 08:33

gardener-robot-ci-1 added the reviewed/ok-to-test Has approval for testing (check PR in detail before setting this label because PR is run on CI/CD) label Apr 5, 2023

gardener-robot-ci-3 removed the reviewed/ok-to-test Has approval for testing (check PR in detail before setting this label because PR is run on CI/CD) label Apr 5, 2023

abdasgupta force-pushed the metrics branch from 1d419f1 to 2fde2be Compare April 6, 2023 04:36

gardener-robot-ci-1 added reviewed/ok-to-test Has approval for testing (check PR in detail before setting this label because PR is run on CI/CD) and removed reviewed/ok-to-test Has approval for testing (check PR in detail before setting this label because PR is run on CI/CD) labels Apr 6, 2023

abdasgupta force-pushed the metrics branch from 2fde2be to c97e91c Compare April 6, 2023 05:20

gardener-robot-ci-1 added the reviewed/ok-to-test Has approval for testing (check PR in detail before setting this label because PR is run on CI/CD) label Apr 6, 2023

gardener-robot-ci-2 removed the reviewed/ok-to-test Has approval for testing (check PR in detail before setting this label because PR is run on CI/CD) label Apr 6, 2023

abdasgupta force-pushed the metrics branch from c97e91c to 02fcbcf Compare April 10, 2023 18:20

gardener-robot-ci-2 added the reviewed/ok-to-test Has approval for testing (check PR in detail before setting this label because PR is run on CI/CD) label Apr 10, 2023

gardener-robot-ci-1 removed the reviewed/ok-to-test Has approval for testing (check PR in detail before setting this label because PR is run on CI/CD) label Apr 10, 2023

abdasgupta force-pushed the metrics branch from 02fcbcf to 60ffbb8 Compare April 13, 2023 17:44

gardener-robot-ci-2 added the reviewed/ok-to-test Has approval for testing (check PR in detail before setting this label because PR is run on CI/CD) label Apr 13, 2023

gardener-robot-ci-1 removed the reviewed/ok-to-test Has approval for testing (check PR in detail before setting this label because PR is run on CI/CD) label Apr 13, 2023

shreyas-s-rao requested changes Apr 17, 2023

View reviewed changes

gardener-robot added the needs/changes Needs (more) changes label Apr 17, 2023

abdasgupta force-pushed the metrics branch from 60ffbb8 to a7ba8e4 Compare April 21, 2023 10:09

gardener-robot-ci-2 added the reviewed/ok-to-test Has approval for testing (check PR in detail before setting this label because PR is run on CI/CD) label Apr 21, 2023

gardener-robot-ci-3 removed the reviewed/ok-to-test Has approval for testing (check PR in detail before setting this label because PR is run on CI/CD) label Apr 21, 2023

aaronfern reviewed Apr 25, 2023

View reviewed changes

controllers/compaction/reconciler.go Outdated Show resolved Hide resolved

controllers/compaction/reconciler.go Show resolved Hide resolved

controllers/compaction/reconciler.go Outdated Show resolved Hide resolved

gardener-robot added needs/review Needs review and removed reviewed/lgtm Has approval for merging labels May 8, 2023

gardener-robot added the reviewed/do-not-merge Has no approval for merging as it may break things, be of poor quality or have (ext.) dependencies label May 8, 2023

gardener-robot removed the reviewed/do-not-merge Has no approval for merging as it may break things, be of poor quality or have (ext.) dependencies label May 9, 2023

ishan16696 requested changes May 19, 2023

View reviewed changes

abdasgupta added 4 commits May 22, 2023 12:42

Added support for compaction metrics in druid.

a78db9a

Addressed Shreyas comments.

c4c3aa9

Addressed Shreyas second review.

0142874

Addressed Shreys' third review.

791792d

abdasgupta force-pushed the metrics branch from 9b447ef to b7965f8 Compare May 22, 2023 07:22

gardener-robot added the needs/second-opinion Needs second review by someone else label May 22, 2023

gardener-robot-ci-2 removed the reviewed/ok-to-test Has approval for testing (check PR in detail before setting this label because PR is run on CI/CD) label May 22, 2023

Ishan's comments addressed.

29ff459

abdasgupta force-pushed the metrics branch from b7965f8 to 29ff459 Compare May 23, 2023 17:41

gardener-robot-ci-3 added reviewed/ok-to-test Has approval for testing (check PR in detail before setting this label because PR is run on CI/CD) and removed reviewed/ok-to-test Has approval for testing (check PR in detail before setting this label because PR is run on CI/CD) labels May 23, 2023

ishan16696 requested changes May 24, 2023

View reviewed changes

controllers/compaction/metrics.go Outdated Show resolved Hide resolved

controllers/compaction/metrics.go Outdated Show resolved Hide resolved

controllers/compaction/metrics.go Outdated Show resolved Hide resolved

Addressed Ishan's second review.

6a91d9d

gardener-robot-ci-2 added the reviewed/ok-to-test Has approval for testing (check PR in detail before setting this label because PR is run on CI/CD) label May 24, 2023

gardener-robot-ci-1 removed the reviewed/ok-to-test Has approval for testing (check PR in detail before setting this label because PR is run on CI/CD) label May 24, 2023

ishan16696 approved these changes May 25, 2023

View reviewed changes

abdasgupta merged commit f5919fe into gardener:master May 26, 2023

gardener-robot added the status/closed Issue is closed (either delivered or triaged) label May 26, 2023

abdasgupta deleted the metrics branch May 26, 2023 09:35

This was referenced May 26, 2023

[Feature] Alerts for the compaction job metrics #603

Closed

Added service for metrics endpoint of etcd-druid gardener/gardener#8014

Merged

abdasgupta mentioned this pull request Jun 5, 2023

[Feature] ☂️ Monitor compaction jobs running on shoot control planes #610

Open

9 tasks

shreyas-s-rao assigned abdasgupta Jul 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added support for compaction metrics in druid. #569

Added support for compaction metrics in druid. #569

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Added support for compaction metrics in druid. #569

Added support for compaction metrics in druid. #569

Uh oh!

Conversation

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!