Refactor code to fetch current LSN position from Postgres. #608

dimitri · 2024-01-02T10:32:48Z

No description provided.

arajkumar · 2024-01-03T10:22:33Z

src/bin/pgcopydb/catalog.c

-	" write_lsn pg_lsn, flush_lsn pg_lsn, replay_lsn pg_lsn)"
+	" write_lsn pg_lsn, flush_lsn pg_lsn, replay_lsn pg_lsn)",
+
+	"create table lsn_tracking(source pg_lsn, target pg_lsn)"


Are you planning to move file based lsn tracking to sqlite?

(I don't see this table being used anywhere.)

Ooops, missed the clean-up for that. The idea crossed my mind yeah, but I wanted to clean-up some other aspects first (summary information and top-level timings in the SQLite database so that pgcopydb list summary can represent multi-steps activity and also be used concurrently to the main command, or after the fact).

arajkumar · 2024-01-03T10:27:00Z

I also see lots of warnings after moving sentinel to sqlite. Should we change the log level to debug?

10:04:16.611 2223479 WARN   Skipping sentinel replay_lsn update: failed to find a durable LSN matching current flushLSN
10:04:16.792 2223479 WARN   Skipping sentinel replay_lsn update: failed to find a durable LSN matching current flushLSN
10:04:16.975 2223479 WARN   Skipping sentinel replay_lsn update: failed to find a durable LSN matching current flushLSN
10:04:17.160 2223479 WARN   Skipping sentinel replay_lsn update: failed to find a durable LSN matching current flushLSN
10:04:17.360 2223479 WARN   Skipping sentinel replay_lsn update: failed to find a durable LSN matching current flushLSN
10:04:17.536 2223479 WARN   Skipping sentinel replay_lsn update: failed to find a durable LSN matching current flushLSN
10:04:26.113 2223477 INFO   Reported write_lsn 3/4247FD60, flush_lsn 3/42297318, replay_lsn 3/41E6C410
10:04:33.200 2223479 WARN   Skipping sentinel replay_lsn update: failed to find a durable LSN matching current flushLSN
10:04:36.113 2223477 INFO   Reported write_lsn 3/4247FDC8, flush_lsn 3/4247FD60, replay_lsn 3/41E6C410
10:04:45.322 2223479 WARN   Skipping sentinel replay_lsn update: failed to find a durable LSN matching current flushLSN
10:04:47.115 2223477 INFO   Reported write_lsn 3/4247FDF8, flush_lsn 3/4247FDC8, replay_lsn 3/41E6C410
10:04:53.536 2223479 WARN   Skipping sentinel replay_lsn update: failed to find a durable LSN matching current flushLSN
10:04:57.114 2223477 INFO   Reported write_lsn 3/4247FE60, flush_lsn 3/4247FDF8, replay_lsn 3/41E6C410
10:05:05.565 2223479 WARN   Skipping sentinel replay_lsn update: failed to find a durable LSN matching current flushLSN
10:05:07.115 2223477 INFO   Reported write_lsn 3/4247FF60, flush_lsn 3/4247FE60, replay_lsn 3/41E6C410

dimitri · 2024-01-03T11:13:40Z

I also see lots of warnings after moving sentinel to sqlite. Should we change the log level to debug?

Either that or find a way to avoid the warning by having the data we're looking for? ;-)

arajkumar · 2024-01-04T13:13:37Z

@dimitri What happens with my tests is that the source being ingested with parallel connections, but the target being replicated with single connection causing the huge lag between source & target.
Due to this, we couldn't the matching target insert lsn for the flush_lsn from source.

I'm wondering, why can't we just use the LSN from pg_replication_origin_progress as durable LSN(replay_lsn)?

arajkumar · 2024-01-04T13:23:25Z

Ahh, I think we have to fetch LSN from target to compare against to find the durable LSN, not the source. Let me fix it.

Refactor code to fetch current LSN position from Postgres.

3e74716

dimitri added this to the v0.15 milestone Jan 2, 2024

dimitri self-assigned this Jan 2, 2024

dimitri merged commit 4273b4e into main Jan 2, 2024

dimitri deleted the refactor/fetch-current-lsn branch January 2, 2024 11:17

arajkumar reviewed Jan 3, 2024

View reviewed changes

arajkumar mentioned this pull request Jan 4, 2024

Problem: Incorrect flush LSN from source being used to find durable LSN #615

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refactor code to fetch current LSN position from Postgres. #608

Refactor code to fetch current LSN position from Postgres. #608

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Refactor code to fetch current LSN position from Postgres. #608

Refactor code to fetch current LSN position from Postgres. #608

Uh oh!

Conversation

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!