MigrationModel duplicate entry #39994

mabartos · 2025-05-27T13:49:58Z

Cleanup for duplicated old records in MIGRATION_MODEL table
Unique constraint on the VERSION col in MIGRATION_MODEL table
Advanced lock mechanism will be done in a follow-up issues as this might be backported to 26.2.x

I've reproduced the issue before:

After these changes, everything should be good:

Testing approach

Included these changes into distribution and built custom image mabartos/keycloak:26.3.0
Having Deployment config with 5 replicas
Initial version Keycloak 26.1.3
Then upgraded to the mabartos/keycloak/26.3.0
Change sets were propagated (cleanup + unique constraint)
Table contains only 2 records with versions (26.1.3 and 26.3.0)

Used Testing Deployment

apiVersion: v1
kind: Service
metadata:
  name: keycloak
  labels:
    app: keycloak
spec:
  ports:
    - protocol: TCP
      port: 8080
      targetPort: http
      name: http
  selector:
    app: keycloak
  type: ClusterIP
---
apiVersion: v1
kind: Service
metadata:
  labels:
    app: keycloak
  # Used to
  name: keycloak-discovery
spec:
  selector:
    app: keycloak
  # Allow not-yet-ready Pods to be visible to ensure the forming of a cluster if Pods come up concurrently
  # publishNotReadyAddresses: true
  clusterIP: None
  type: ClusterIP
---
apiVersion: apps/v1
# Use a stateful setup to ensure that for a rolling update Pods are restarted with a rolling strategy one-by-one.
# This prevents losing in-memory information stored redundantly in two Pods.
#kind: StatefulSet
kind: Deployment
metadata:
  name: keycloak
  labels:
    app: keycloak
spec:
  #serviceName: keycloak-discovery
  # Run with one replica to save resources, or with two replicas to allow for rolling updates for configuration changes
  replicas: 5
  strategy:
    type: Recreate
  selector:
    matchLabels:
      app: keycloak
  template:
    metadata:
      labels:
        app: keycloak
    spec:
      containers:
        - name: keycloak
          image: quay.io/keycloak/keycloak:26.1.3
          args: ["start"]
          env:
            - name: KC_BOOTSTRAP_ADMIN_USERNAME
              value: "admin"
            - name: KC_BOOTSTRAP_ADMIN_PASSWORD
              value: "admin"
            # In a production environment, add a TLS certificate to Keycloak to either end-to-end encrypt the traffic between
            # the client or Keycloak, or to encrypt the traffic between your proxy and Keycloak.
            # Respect the proxy headers forwarded by the reverse proxy
            # In a production environment, verify which proxy type you are using, and restrict access to Keycloak
            # from other sources than your proxy if you continue to use proxy headers.
            - name: KC_PROXY_HEADERS
              value: "xforwarded"
            - name: KC_HTTP_ENABLED
              value: "true"
            # In this explorative setup, no strict hostname is set.
            # For production environments, set a hostname for a secure setup.
            - name: KC_HOSTNAME_STRICT
              value: "false"
            - name: KC_HEALTH_ENABLED
              value: "true"
            - name: 'KC_CACHE'
              value: 'ispn'
            # Use the Kubernetes configuration for distributed caches which is based on DNS
            - name: 'KC_CACHE_STACK'
              value: 'kubernetes'
            # Passing the Pod's IP primary address to the JGroups clustering as this is required in IPv6 only setups
            - name: POD_IP
              valueFrom:
                fieldRef:
                  fieldPath: status.podIP
            # Instruct JGroups which DNS hostname to use to discover other Keycloak nodes
            # Needs to be unique for each Keycloak cluster
            - name: JAVA_OPTS_APPEND
              value: '-Djgroups.dns.query="keycloak-discovery" -Djgroups.bind.address=$(POD_IP)'
            - name: 'KC_DB_URL_DATABASE'
              value: 'keycloak'
            - name: 'KC_DB_URL_HOST'
              value: 'postgres'
            - name: 'KC_DB'
              value: 'postgres'
            # In a production environment, use a secret to store username and password to the database
            - name: 'KC_DB_PASSWORD'
              value: 'keycloak'
            - name: 'KC_DB_USERNAME'
              value: 'keycloak'
          ports:
            - name: http
              containerPort: 8080
            - name: management
              containerPort: 9000  
          startupProbe:
            httpGet:
              path: /health/started
              port: 9000
            failureThreshold: 15
          readinessProbe:
            httpGet:
              path: /health/ready
              port: 9000
          livenessProbe:
            httpGet:
              path: /health/live
              port: 9000
          resources:
            limits:
              cpu: 2000m
              memory: 4000Mi
            requests:
              cpu: 1500m
              memory: 3500Mi
---
# This is deployment of PostgreSQL with an ephemeral storage for testing: Once the Pod stops, the data is lost.
# For a production setup, replace it with a database setup that persists your data.
apiVersion: apps/v1
kind: Deployment
metadata:
  name: postgres
  labels:
    app: postgres
spec:
  
  replicas: 1
  selector:
    matchLabels:
      app: postgres
  template:
    metadata:
      labels:
        app: postgres
    spec:
      containers:
        - name: postgres
          image: mirror.gcr.io/postgres:17
          env:
            - name: POSTGRES_USER
              value: "keycloak"
            - name: POSTGRES_PASSWORD
              value: "keycloak"
            - name: POSTGRES_DB
              value: "keycloak"
            - name: POSTGRES_LOG_STATEMENT
              value: "all"
          ports:
            - name: postgres
              containerPort: 5432
          volumeMounts:
            # Using volume mount for PostgreSQL's data folder as it is otherwise not writable
            - name: postgres-data
              mountPath: /var/lib/postgresql
      volumes:
        - name: postgres-data
          emptyDir: {}
---
apiVersion: v1
kind: Service
metadata:
  labels:
    app: postgres
  name: postgres
spec:
  selector:
    app: postgres
  ports:
    - protocol: TCP
      port: 5432
      targetPort: 5432
  type: ClusterIP

We might probably even update the MigrationModelAdapter to only update fields instead of trying to create a new record.

...tions/jpa/updater/liquibase/custom/JpaUpdate26_2_5_RemoveDuplicateMigrationModelVersion.java

ahus1 · 2025-05-27T18:14:38Z

We might probably even update the MigrationModelAdapter to only update fields instead of trying to create a new record.

I disagree with this approach as we want to do an insert, as the insert and the unique constraint prevent doing a migration twice. Doing a migration twice on a realm might lead to wrong results as a developer would have never thought of that situation.

...tions/jpa/updater/liquibase/custom/JpaUpdate26_2_5_RemoveDuplicateMigrationModelVersion.java

mabartos · 2025-05-29T15:04:57Z

@ahus1 Thanks for the review! 🌴 I've updated the approach based on your comments and I'll test it and fix the test cases later.

mabartos · 2025-05-30T12:20:47Z

Tried the approach again with the testing approach described in the PR description and everything works as expected.

Closes keycloak#39866 Signed-off-by: Martin Bartoš <mabartos@redhat.com> Co-authored-by: Alexander Schwartz <alexander.schwartz@gmx.net>

Signed-off-by: Alexander Schwartz <aschwart@redhat.com>

ahus1 · 2025-06-02T11:51:42Z

@mabartos - I reviewed this changed locally and tested some variations:

When adding a DB Schema, the delete failed as it was duplicating the schema prefix - and if you pass in the schema, you must not add a dot as a suffix, as Liquibase adds it. The safest way is to pass null, and then Liquibase does the right thing.
When there are entries in the table with the same version and the same update_time, the select will not find them, and therefore not delete them. I updated the SQL to compare the IDs instead.
I see that you pass in a normalized ID column. If we do that, we should do the same for the other columns as well.

I've pushed a commit that fixed it for me when testing it locally. Please let me know if you agree with these changes.

Signed-off-by: Martin Bartoš <mabartos@redhat.com>

mabartos · 2025-06-10T13:47:00Z

@ahus1 Thanks for the review and the additional testing!

The safest way is to pass null, and then Liquibase does the right thing.

+1

When there are entries in the table with the same version and the same update_time, the select will not find them, and therefore not delete them. I updated the SQL to compare the IDs instead.

Yes, that is true and your changes kinda resolve this issue. However, as IDs of MigrationModel entity (as PKs) are not auto-incremented and referenced as the resource tags, they are randomly generated and cannot be compared based on time of creation. It can lead to a situation in which a more recent record in a table is deleted even it was added later than the one before for the same version. I've slightly edited the SQL script to consider first the update time and then if those are equal choosing some random one. WDYT?

I see that you pass in a normalized ID column. If we do that, we should do the same for the other columns as well.

+1

Tried the same testing approach and it should work as expected.

...tions/jpa/updater/liquibase/custom/JpaUpdate26_2_6_RemoveDuplicateMigrationModelVersion.java

Signed-off-by: Alexander Schwartz <aschwart@redhat.com>

mabartos · 2025-06-10T15:50:06Z

@ahus1 Just an info that I've added another commit to prevent some confusion: 071c5db

Described in: #39994 (comment)

ahus1

The new SQL is a bit more complicated than I hoped for. I verified it, and also did some manual tests with multiple duplicate entries, and it worked as expected.

Thank you for this change - approving!

mabartos · 2025-06-11T09:13:20Z

The new SQL is a bit more complicated than I hoped for.

@ahus1 Yes, me too, but at least we have it semantically correct. I tried to simplify things for reviews as much as possible.

Thanks!

mabartos requested a review from a team as a code owner May 27, 2025 13:49

mabartos force-pushed the KC-39866 branch from c5f15d7 to 4dffece Compare May 27, 2025 13:51

mabartos marked this pull request as draft May 27, 2025 13:51

mabartos commented May 27, 2025

View reviewed changes

...tions/jpa/updater/liquibase/custom/JpaUpdate26_2_5_RemoveDuplicateMigrationModelVersion.java Outdated Show resolved Hide resolved

ahus1 reviewed May 27, 2025

View reviewed changes

...tions/jpa/updater/liquibase/custom/JpaUpdate26_2_5_RemoveDuplicateMigrationModelVersion.java Outdated Show resolved Hide resolved

ahus1 reviewed May 27, 2025

View reviewed changes

...tions/jpa/updater/liquibase/custom/JpaUpdate26_2_5_RemoveDuplicateMigrationModelVersion.java Outdated Show resolved Hide resolved

mabartos force-pushed the KC-39866 branch from c3f10e4 to 91f3b92 Compare May 29, 2025 15:02

keycloak-github-bot bot added the flaky-test label May 29, 2025

keycloak-github-bot bot mentioned this pull request May 29, 2025

Flaky test: org.keycloak.testsuite.webauthn.account.WebAuthnSigningInTest#checkAuthenticatorTimeLocale #40049

Closed

mabartos force-pushed the KC-39866 branch from 91f3b92 to 67086a6 Compare May 29, 2025 15:31

mabartos mentioned this pull request May 30, 2025

Make UPDATE_TIME unique for MIGRATION_MODEL table #40088

Closed

mabartos force-pushed the KC-39866 branch from 67086a6 to be10bc2 Compare May 30, 2025 12:19

mabartos marked this pull request as ready for review May 30, 2025 12:19

mabartos requested review from a team as code owne 8000 rs May 30, 2025 12:19

keycloak-github-bot bot added team/cloud-native team/core-clients team/core-iam labels May 30, 2025

mabartos requested a review from ahus1 May 30, 2025 12:24

keycloak-github-bot bot mentioned this pull request May 30, 2025

Flaky test: org.keycloak.testsuite.model.session.OfflineSessionPersistenceTest#testPersistenceMultipleNodesClientSessionsAtRandomNode #39429

Closed

MigrationModel duplicate entry

5f44db7

Closes keycloak#39866 Signed-off-by: Martin Bartoš <mabartos@redhat.com> Co-authored-by: Alexander Schwartz <alexander.schwartz@gmx.net>

mabartos force-pushed the KC-39866 branch from be10bc2 to 5f44db7 Compare May 30, 2025 13:21

Review

a37beff

Signed-off-by: Alexander Schwartz <aschwart@redhat.com>

Consider update time and then decide

071c5db

Signed-off-by: Martin Bartoš <mabartos@redhat.com>

mabartos commented Jun 10, 2025

View reviewed changes

...tions/jpa/updater/liquibase/custom/JpaUpdate26_2_6_RemoveDuplicateMigrationModelVersion.java Outdated Show resolved Hide resolved

Cleanup

47041df

Signed-off-by: Alexander Schwartz <aschwart@redhat.com>

ahus1 approved these changes Jun 11, 2025

View reviewed changes

ahus1 merged commit ad92af3 into keycloak:main Jun 11, 2025
76 checks passed

mabartos mentioned this pull request Jun 11, 2025

[26.2] MigrationModel duplicate entry #40404

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

MigrationModel duplicate entry #39994

MigrationModel duplicate entry #39994

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MigrationModel duplicate entry #39994

MigrationModel duplicate entry #39994

Uh oh!

Conversation

Uh oh!

Testing approach

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!