8000 fix: adding imagePullSecret to update job by shawkins · Pull Request #39032 · keycloak/keycloak · GitHub

More Web Proxy on the site http://driver.im/

fix: adding imagePullSecret to update job #39032

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

ahus1 merged 1 commit into keycloak:main from shawkins:iss39015

Apr 17, 2025

Contributor

shawkins commented

•

closes: #39015

@ahus1 @pruivo @vmuzikar I believe that we have quite a few corner cases yet to address with the auto job logic, not just to address image pull secrets. I tried to add TODOs where there are additional concerns.

At a high level:

usage of the unsupported pod template could cause things to fail or be inaccurate in several ways.
starting with the desired state pod spec will copy over additional settings like the imagePullSecrets, but we still end up needing merge logic to the existing state, so it doesn't actually save us that much effort. We only have merge logic for the imagePullSecrets and volumes. We may have to review if anything else seems applicable - or what we may need to do to keep the logic in sync with what the main deployment logic is doing.
having the auto strategy work with start-dev is problematic because of the profile change - we can account for that, or probably just fail / recreate if someone is using start-dev

shawkins force-pushed the iss39015 branch 2 times, most recently from 3f34827 to 6017113 Compare

April 16, 2025 20:55

ahus1 reviewed

View reviewed changes

Contributor

ahus1 left a comment

Thank you for looking into this issue and creating both the PR and the list of TODOs.

With the imagePullSecret fixed, I'd say once we make the name of the workdir volume a bit more specific to avoid clashes, I consider this PR ready to be merged.

Optional: Copying over init containers could be part of this PR if we think this is a low-hanging fruit, or a follow-up. AFAIK the init containers run sequentially, and we could add ours at the end which should be fine.

There was a new comment in the parent issue about the imagePullPolicy, not sure if this is really not picked up and where this would be missing.

I don't really understand how a start-dev would end up in here. If it is via a podTemplate, I'd say it is not supported. If there is no simple way to handle it beyond also replacing start-dev or adding a profile, I'd say failing and then recreate is fine. I would assume that it is failing today already, as the init container wouldn't create the JSON, and then the second one wouldn't be able to read it.

...ator/src/main/java/org/keycloak/operator/controllers/KeycloakUpdateJobDependentResource.java Outdated

+                      builder.withInitContainers();
+                      builder.withContainers();
+                      // TODO: should the scheduling fields be tweaked

Contributor

ahus1

I'd say it would be usually safe to keep them as is. Some people would want to run Keycloak only on specific nodes, and could for example also ensure that it runs only on ARM architecture nodes.

As an alternative, we might eventually add some separate scheduling to the upgrade part of the CR. Until then, I think we're good to keep it as is.

Contributor Author

shawkins

Until then, I think we're good to keep it as is.

I'm fine with that, just wanted to highlight a behavioral difference between what the logic was doing and will now do. Let's just expand the comment to explain why keeping the scheduling is ok.

Contributor

ahus1

•

agreed, please expand the comment.

...ator/src/main/java/org/keycloak/operator/controllers/KeycloakUpdateJobDependentResource.java Outdated

+                      // mix in the existing state
+                      var currentSecrets = builder.buildImagePullSecrets();
+                      current.getSpec().getTemplate().getSpec().getImagePullSecrets().stream().filter(s -> !currentSecrets.contains(s)).forEach(builder::addToImagePullSecrets);
+                      // TODO: if the name is the same, but the volume has changed the behavior will be inconsistent

Contributor

ahus1

I'd say if we would make the volume name a bit more specific we should be good for now.

See above for a suggestion.

Contributor Author

shawkins

•

This concern is about a difference between the desired and existing state. Lets say in the existing state you have a volume name config pointed at configmap X, and in the desired state you a volume also named config pointed at configmap Y - we are not accounting for that possibility.

This could happen with more than just via the unsupported podTemplate, such as via the cache config configmap.

If we want to account for this we'll need to do a name remapping as part of the merge.

Contributor

ahus1

Thank you, I now see your point. I'd say this is in the unsupported space, and within the margin of error when we accept it as-is now.

...ator/src/main/java/org/keycloak/operator/controllers/KeycloakUpdateJobDependentResource.java Outdated

               @ApplicationScoped
               public class KeycloakUpdateJobDependentResource extends CRUDKubernetesDependentResource<Job, Keycloak> {
                   // shared volume configuration
-                  private static final String WORK_DIR_VOLUME_NAME = "workdir";
+                  private static final String WORK_DIR_VOLUME_NAME = "workdir"; // TODO: could conflict with an existing volume name

Contributor

ahus1

Let's try to make it unique. Alternatively we could make it non-static and add a random suffix.

Suggested change

      
                private static final String WORK_DIR_VOLUME_NAME = "workdir"; // TODO: could conflict with an existing volume name
          
                private static final String WORK_DIR_VOLUME_NAME = "update-job-temporary-workdir";

...ator/src/main/java/org/keycloak/operator/controllers/KeycloakUpdateJobDependentResource.java Outdated

-                          downstream.accept(arg);
-                      }).toList();
+                      // TODO: reuse ConfigArgConfigSource parsing
+                      // also start-dev is problematic - at the very least --profile=dev should be added

Contributor

ahus1

I wonder how a start-dev would end up here ... again via a podTemplate? I'd say this is not supported when using the Operator?

Contributor Author

shawkins

Yes, the podTemplate is the only way to use start-dev.

If we are ok with letting the job fail to start, then I can just minimize this comment and move on - this in general is a misconfiguration because you'd have to go to great lengths to cluster in dev mode.

Contributor

ahus1

I'm ok with letting it fail to start.

...ator/src/main/java/org/keycloak/operator/controllers/KeycloakUpdateJobDependentResource.java Outdated

+                      // remove things we don't want
+                      // TODO: this may actually not be correct - via the unsupported PodTemplate someone could make configuration dependent upon an init container
+                      builder.withInitContainers();

Contributor

ahus1

Copying over init containers might be a good thing. But as PodTemplate is not supported, might do a best effort here.

Reading up on init containers, it seems that they run sequentially. So we could then add ours at the end.

Contributor Author

shawkins commented

There was a new comment in the parent issue about the imagePullPolicy, not sur 8000 e if this is really not picked up and where this would be missing.

I don't believe the comment is correct. imagePullPolicy is inherited at the container level. I believe they are just seeing the same issue with imagePullSecrets.

I don't really understand how a start-dev would end up in here. If it is via a podTemplate, I'd say it is not supported. If there is no simple way to handle it beyond also replacing start-dev or adding a profile, I'd say failing and then recreate is fine. I would assume that it is failing today already, as the init container wouldn't create the JSON, and then the second one wouldn't be able to read it.

Yes, hopefully we can just let it fail rather than adding more logic to gracefully handle an unsupported situation.

Contributor

ahus1 commented

Thank you for the replies, I added some comments above.

If we want to be on the safe side, we could prevent rolling updates when there is a podTemplate present. Still I think that would go too far.

We could add a warning about podTemplates in general and especially changes to it, and that this might affect the correct workings of the upgrade command.

Are we ready to wrap the low-hanging and obvious items (renaming the workdir, extending the comment) in this PR? I would also add the initContainers.

Updating the docs to warn about unsupported podTemplate could be done in a follow-up or in this one. I am not sure how to handle init containers.

Contributor Author

shawkins commented

Are we ready to wrap the low-hanging and obvious items (renaming the workdir, extending the comment) in this PR?

As long as you are good with not accounting for volume changes, like a change in cache config configmap, then I can wrap this up.

Contributor

ahus1 commented

As long as you are good with not accounting for volume changes, like a change in cache config configmap, then I can wrap this up.

I'm good with that.

shawkins force-pushed the iss39015 branch from 6017113 to 4b42ef9 Compare

April 17, 2025 13:16

shawkins marked this pull request as ready for review

April 17, 2025 13:18

shawkins requested review from a team as code owners

April 17, 2025 13:18

keycloak-github-bot bot added team/cloud-native labels

Contributor Author

shawkins commented

The comments and volume / mount names have been updated, should be ready for review.

ahus1 previously approved these changes

View reviewed changes

ahus1 enabled auto-merge (squash)

April 17, 2025 13:28


          fix: adding imagePullSecret to update job

3e4741e

closes: keycloak#39015

Signed-off-by: Steve Hawkins <shawkins@redhat.com>

shawkins dismissed ahus1’s stale review via

3e4741e

April 17, 2025 13:32

shawkins force-pushed the iss39015 branch from 4b42ef9 to 3e4741e Compare

April 17, 2025 13:32

6293

ahus1 mentioned this pull request

Document operator Auto update strategy when used with podTemplate #39059

Closed

Contributor

ahus1 commented

Follow-up issue for the docs: #39059

shawkins requested a review from ahus1

April 17, 2025 15:40

ahus1 approved these changes

View reviewed changes

ahus1 merged commit 26de9ef into keycloak:main

77 checks passed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

team/cloud-native

0