DBZ-8594 Prevent data loss when primary key update is last operation in transaction #224

twthorn · 2025-01-21T22:52:40Z

We found an edge case described in DBZ-8594

There's another possible way to implement this: Modify VitessChangeRecordEmitter - we could modify our implementation to set the offset before emitting the final delete record here. This would be possible but it seemed like a more custom solution, we'd prefer to be as standard as possible. Let me know if you think this is preferred. I think VitessOffsetContext would be called there to resetVgtid (and maybe in create/delete/update functions as well). The thing I don't like is we don't get the bug fixes or changes from the class we inherit from so we have to maintain more custom/overridden logic.

The main downside of this approach is seen in testCopyTableAndRestart ie if we do a special gtid (for snapshot or to speed up to current) then after that no other operations happen (no new vgtid's) then the restart vgtid is still the special one (snapshot or fastforward to current) so that operation would get repeated. Perhaps the gudiance we can give users is to enable transaction metadata (so begin/commit events are always sent even if tables are unaffected), but even that is not a guarantee.

Couple examples of other connectors I think may be at risk
Postgres
SQL Server

…in transaction

twthorn · 2025-01-21T22:54:35Z

@jpechane Can you give this is a look when you get the chance? Thanks!

HenryCaiHaiying

Is that possible to use Kafka transaction to solve the problem?

If DELETE, Tombstone, CREATE are 3 events belong to the same Vitess Transaction, should we put the three message publishing into Kafka in the same Kafka Transaction? The downstream consumer won't see the messages until the whole transaction commits?

twthorn · 2025-01-22T17:28:32Z

@HenryCaiHaiying There are a few reasons to avoid transactions

Tight coupling - Debezium embedded engine mode allows Debezium to be run in a standard java application without kafka/kafka connect. We do not want to make any assumptions about what other external systems will be associated with Debezium. We simply want each record to contain correct content (ie the offset refers to the last fully processed transaction)
Performance - enabling transactions has some throughput hit
Rigidity - all deploys must have transactions enabled to operate correctly

HenryCaiHaiying · 2025-01-23T05:45:19Z

src/main/java/io/debezium/connector/vitess/VitessStreamingChangeEventSource.java

            if (message.isTransactionalMessage()) {
-                // Tx BEGIN/END event
-                offsetContext.rotateVgtid(newVgtid, message.getCommitTime());


In the old code, rotateVgtid() is called on all cases of isTransactionMessage() (including BEGIN and COMMIT and other unknown types), do we need to cover that unknown types in the new code?

For unknown types, these are the only two places we create the instance, with either begin/commit, so there are no other expected types.

It might be safer to add an else {assert here} in case our assumptions are not correct.

HenryCaiHaiying · 2025-01-23T05:47:43Z

src/main/java/io/debezium/connector/vitess/VitessStreamingChangeEventSource.java

@@ -154,10 +163,6 @@ else if (message.getOperation().equals(ReplicationMessage.Operation.HEARTBEAT))

                offsetContext.event(tableId, message.getCommitTime());
                offsetContext.setShard(message.getShard());
-                if (isLastRowOfTransaction) {
-                    // Right before processing the last row, reset the previous offset to the new vgtid so the last row has the new vgtid as offset.
-                    offsetContext.resetVgtid(newVgtid, message.getCommitTime());


So we lost this resetVgid() in the new code, I guess you are assuming the COMMIT message will follow later, does COMMIT message always happens? Also from the old commit, it seems the last row has the new vgtid, looks like this won't be the same in the new code, any implications on this behavior change?

does COMMIT message always happens?

VitessReplicationConnection ensures that we always receive a COMMIT after a BEGIN (otherwise throws an error). There is one vstream copy edge case where duplicate BEGINs can be received, but it discards the events so this code doesn't need to handle those.

Also from the old commit, it seems the last row has the new vgtid, looks like this won't be the same in the new code, any implications on this behavior change?

Yes, this is the key part that leads to the data loss bug.

Let the current transaction VGTID be n, previous is n-1, next transaction is n+1

Previous behavior:

BEGIN - rotateVgtid - set currentVgtid to n, restartVgtid is n-1

INSERT/UPDATE/DELETE not last operation - no-op on offset context

INSERT/UPDATE/DELETE that is the last operation - reset vgtid - sets the currentVgtid to be equal to the n. Set restartVgtid equal to n (wrongfully, since it may still have multiple messages to send in case of primary key update (delete, tombstone, create), and can fail part way between them).

COMMIT - rotateVgtid - only does anything if the newVgtid does not equal the currentVgtid, but since the newVgtid is still n and the currentVgtid has already been set to n, this is a no-op and does nothing.

New behavior:

BEGIN - rotateVgtid - set currentVgtid to n, restartVgtid is the n-1

INSERT/UPDATE/DELETE any order, last or not - no op on offset context

COMMIT - resetVgtid - set current & restart VGTIDs to n - the offset will only be committed if the commit event is successfully produced (if tx metadata is enabled) or when it receives a write for the n+1 (since the offset vgtid ie the restartVgtid will point to n) or if it sends a heartbeat event (in the case no tx metadata, and no subsequent writes, that heartbeat will have offset / restartVgtid set to n)

Sounds good

DBZ-8594 Prevent data loss when primary key update is last operation …

803f85a

…in transaction

HenryCaiHaiying reviewed Jan 22, 2025

View reviewed changes

HenryCaiHaiying reviewed Jan 23, 2025

View reviewed changes

twthorn requested a review from HenryCaiHaiying January 23, 2025 16:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DBZ-8594 Prevent data loss when primary key update is last operation in transaction #224

DBZ-8594 Prevent data loss when primary key update is last operation in transaction #224

twthorn commented Jan 21, 2025 •

edited

Loading

twthorn commented Jan 21, 2025

HenryCaiHaiying left a comment

twthorn commented Jan 22, 2025

HenryCaiHaiying Jan 23, 2025

twthorn Jan 23, 2025

HenryCaiHaiying Jan 24, 2025

HenryCaiHaiying Jan 23, 2025

twthorn Jan 23, 2025 •

edited

Loading

HenryCaiHaiying Jan 24, 2025

DBZ-8594 Prevent data loss when primary key update is last operation in transaction #224

Are you sure you want to change the base?

DBZ-8594 Prevent data loss when primary key update is last operation in transaction #224

Conversation

twthorn commented Jan 21, 2025 • edited Loading

twthorn commented Jan 21, 2025

HenryCaiHaiying left a comment

Choose a reason for hiding this comment

twthorn commented Jan 22, 2025

HenryCaiHaiying Jan 23, 2025

Choose a reason for hiding this comment

twthorn Jan 23, 2025

Choose a reason for hiding this comment

HenryCaiHaiying Jan 24, 2025

Choose a reason for hiding this comment

HenryCaiHaiying Jan 23, 2025

Choose a reason for hiding this comment

twthorn Jan 23, 2025 • edited Loading

Choose a reason for hiding this comment

HenryCaiHaiying Jan 24, 2025

Choose a reason for hiding this comment

twthorn commented Jan 21, 2025 •

edited

Loading

twthorn Jan 23, 2025 •

edited

Loading