Fix nextXid tracking bug on standbys (9.5-11 only).
authorThomas Munro <tmunro@postgresql.org>
Thu, 12 Mar 2020 05:06:54 +0000 (18:06 +1300)
committerThomas Munro <tmunro@postgresql.org>
Thu, 12 Mar 2020 05:06:54 +0000 (18:06 +1300)
RecordKnownAssignedTransactionIds() should never move
nextXid backwards.  Before this commit, that could happen
if some other code path had advanced it without advancing
latestObservedXid.

One consequence is that a well timed XLOG_CHECKPOINT_ONLINE
could cause hot standby feedback messages to get confused
and report an xmin from a future epoch, potentially allowing
vacuum to run too soon on the primary.

Repair, by making sure RecordKnownAssignedTransactionIds()
can only move nextXid forwards.

In release 12 and master, this was already done by commit
2fc7af5e, which consolidated similar code and straightened
out this bug.  Back-patch to supported releases before that.

Author: Eka Palamadai <ekanatha@amazon.com>
Discussion: https://postgr.es/m/98BB4805-D0A2-48E1-96F4-15014313EADC@amazon.com

src/backend/storage/ipc/procarray.c

index ddd3461d56eb9ac7453846cb1dacd119653d0a38..50b9aea6838736f250ab5f5b95c94b93d4fc98bb 100644 (file)
@@ -3256,7 +3256,8 @@ RecordKnownAssignedTransactionIds(TransactionId xid)
        next_expected_xid = latestObservedXid;
        TransactionIdAdvance(next_expected_xid);
        LWLockAcquire(XidGenLock, LW_EXCLUSIVE);
-       ShmemVariableCache->nextXid = next_expected_xid;
+       if (TransactionIdFollows(next_expected_xid, ShmemVariableCache->nextXid))
+           ShmemVariableCache->nextXid = next_expected_xid;
        LWLockRelease(XidGenLock);
    }
 }