Expose triggering on-demand full snapshot via HTTP endpoint #143

shreyas-s-rao · 2019-03-27T06:15:34Z

What this PR does / why we need it:
This PR adds functionality to trigger on-demand full snapshots via the HTTP endpoint /snapshot/full.
This PR also fixes a few unhandled scenarios for the case when snapstore provider is not configured, such as etcd readiness probe handling and defragmentation. It also refactors the server command to make it more modular.

Which issue(s) this PR fixes:
Fixes #113

Special notes for your reviewer:
Also refactored the TriggerFullSnapshot and DefragDataPeriodically methods, as well as the order of object creation in NewServerCommand in order to pass the Snapshotter to the HTTPHandler.
Will add unit tests shortly.

Release note:

Added functionality to trigger on-demand full snapshots via the HTTP endpoint `/snapshot/full`.

swapnilgm

Please address the comments. NewServerCommand code is not that simple. It involves complex coordination logic. So, please handle it carefully.

pkg/server/httpAPI.go

cmd/server.go

shreyas-s-rao · 2019-05-13T09:37:30Z

@swapnilgm I have made the suggested changes and also refactored code to wait for etcd probe to succeed before setting http status to OK, even in the case of etcd-events.

pkg/server/httpAPI.go

amshuman-kr

Might it simplify the code if we split the snapshotting and non-snapshotting cases as separate funcs? Channels and goroutines embedded in conditions in a long func makes me nervous :-)

shreyas-s-rao · 2019-05-14T09:01:20Z

@amshuman-kr I did consider this, but given it would create a lot of code duplication, I stuck to the current approach.

amshuman-kr · 2019-05-14T09:03:34Z

it would create a lot of code duplication

@shreyas-s-rao how about smaller funcs for the reusable code?

shreyas-s-rao · 2019-05-14T09:32:41Z

@shreyas-s-rao how about smaller funcs for the reusable code?

Makes sense @amshuman-kr . I'll make the necessary changes.

shreyas-s-rao · 2019-06-25T09:48:06Z

@amshuman-kr @swapnilgm I have addressed the suggestions. Also fixed a couple of things wrt no-snapstore-provider case, namely probe loop and periodic defrag (for etcd-events case).

amshuman-kr

LGTM apart from one comment about checking the snapshotterEnabled flag twice.

The comments about wait period between probing etcd can be discussed/addressed later.

amshuman-kr · 2019-06-26T05:51:02Z

cmd/server.go

+				insecureSkipVerify,
+				etcdEndpoints)
+
+			if snapshotterEnabled {


Can't this block be merged with this to avoid checking the same flag twice?

amshuman-kr · 2019-06-26T05:58:02Z

cmd/server.go

+		if err != nil {
+			logger.Errorf("Failed to probe etcd: %v", err)
+			handler.Status = http.StatusServiceUnavailable
+			continue


I know this comes from existing code. So, not necessarily in this PR, but shouldn't we think about a wait period before the next probe?

Yes Amshu. It would be better to quickly merge this PR and take this up in a separate PR.

The call timeout for etcd probe is set to default 30 sec i.e. each every probe call blocks for 30 sec if etcd is unavailable. I think its not necessary to wait in between as it dosen't introduce much network traffic as well.

amshuman-kr · 2019-06-26T06:26:00Z

cmd/server.go

+		if err != nil {
+			logger.Errorf("Failed to probe etcd: %v", err)
+			handler.Status = http.StatusServiceUnavailable
+			continue


Same as above.

shreyas-s-rao · 2019-06-26T08:46:05Z

@amshuman-kr @swapnilgm I've addressed the latest suggestions. PTAL.

amshuman-kr

LGTM

swapnilgm

Overall LGTM. Refactoring looks good as well. Just one correction mentioned in comment. Please address it.

swapnilgm · 2019-07-03T12:01:16Z

pkg/server/httpAPI.go

+				// This is needed to stop the currently running snapshotter.
+				atomic.StoreUint32(&h.AckState, HandlerAckWaiting)
+				h.Logger.Info("Changed handler state.")
+				h.ReqCh <- emptyStruct


Why do we have two requests to stop snapshotter? Why do we have previous one https://github.com/gardener/etcd-backup-restore/pull/143/files#diff-912f19dfd8da2a79f988fd5045cd7dd0R146

Addressed. Thanks for the catch!

Signed-off-by: Shreyas Rao <shreyas.sriganesh.rao@sap.com>

swapnilgm

LGTM. Now that we have snapshotter in as a part of handler. Probably we could get rid of reqCH and ackCh on handler. And simple make use of ssrStateMutex. But we will have that in separate PR.

shreyas-s-rao added this to the 0.7.0 milestone Mar 27, 2019

shreyas-s-rao requested review from georgekuruvillak and swapnilgm as code owners March 27, 2019 06:15

swapnilgm suggested changes Mar 29, 2019

View reviewed changes

pkg/server/httpAPI.go Outdated Show resolved Hide resolved

pkg/server/httpAPI.go Outdated Show resolved Hide resolved

cmd/server.go Outdated Show resolved Hide resolved

shreyas-s-rao force-pushed the full-snapshot-on-demand branch 2 times, most recently from 4a65144 to 5075f72 Compare May 13, 2019 09:34

ashwani2k reviewed May 14, 2019

View reviewed changes

pkg/server/httpAPI.go Outdated Show resolved Hide resolved

shreyas-s-rao force-pushed the full-snapshot-on-demand branch from 5075f72 to b540b30 Compare May 14, 2019 08:47

amshuman-kr reviewed May 14, 2019

View reviewed changes

shreyas-s-rao force-pushed the full-snapshot-on-demand branch from b540b30 to c409c22 Compare June 25, 2019 09:45

amshuman-kr suggested changes Jun 26, 2019

View reviewed changes

shreyas-s-rao force-pushed the full-snapshot-on-demand branch from c409c22 to c965a25 Compare June 26, 2019 08:43

amshuman-kr approved these changes Jun 26, 2019

View reviewed changes

shreyas-s-rao force-pushed the full-snapshot-on-demand branch from c965a25 to 4469567 Compare June 26, 2019 09:36

swapnilgm suggested changes Jul 4, 2019

View reviewed changes

shreyas-s-rao force-pushed the full-snapshot-on-demand branch from 4469567 to 8fe06ed Compare July 4, 2019 06:35

shreyas-s-rao added 3 commits July 4, 2019 12:06

Expose triggering on-demand full snapshot via HTTP endpoint

c5de26f

Signed-off-by: Shreyas Rao <shreyas.sriganesh.rao@sap.com>

Fix http health status for no-storage-provider case

5a86fcd

Signed-off-by: Shreyas Rao <shreyas.sriganesh.rao@sap.com>

Add periodic defragmentation for no-storage-provider case

8ef9e6f

Signed-off-by: Shreyas Rao <shreyas.sriganesh.rao@sap.com>

shreyas-s-rao force-pushed the full-snapshot-on-demand branch from 8fe06ed to 8ef9e6f Compare July 4, 2019 06:37

swapnilgm added reviewed/ok-to-test Has approval for testing (check PR in detail before setting this label because PR is run on CI/CD) and removed needs/ok-to-test Needs approval for testing (check PR in detail before setting this label because PR is run on CI/CD) labels Jul 4, 2019

gardener-robot-ci-1 added needs/ok-to-test Needs approval for testing (check PR in detail before setting this label because PR is run on CI/CD) and removed reviewed/ok-to-test Has approval for testing (check PR in detail before setting this label because PR is run on CI/CD) labels Jul 4, 2019

swapnilgm approved these changes Jul 4, 2019

View reviewed changes

swapnilgm merged commit 8c8d781 into gardener:master Jul 4, 2019

shreyas-s-rao deleted the full-snapshot-on-demand branch July 5, 2019 02:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Expose triggering on-demand full snapshot via HTTP endpoint #143

Expose triggering on-demand full snapshot via HTTP endpoint #143

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Expose triggering on-demand full snapshot via HTTP endpoint #143

Expose triggering on-demand full snapshot via HTTP endpoint #143

Uh oh!

Conversation

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!