Hotbackup Deadlock and Consistency fixes. #18928

maierlars · 2023-05-10T15:51:25Z

Scope & Purpose

Before this PR, when the user requested a HotBackup, the ArangoDB cluster waited for all in-progress transactions to be committed. This lead to deadlocks.
Also, the HotBackup snapshot of the RocksDB database did only include some portion of the WAL and was not guaranteed to include enough of the WAL to recover MerkleTrees. This would lead to MerkleTrees being inconsistent with the database and potential data loss from replication.

After this PR, requesting a HotBackup merely prevents transactions from being committed.

The HotBackup process has to take two snapshots: One of the RocksDB store, and one of the ArangoSearch store.

Consistency is achieved by taking a snapshot of RocksDB with a large enough portion of the WAL so that the recovery process after the DBServer restart can replay all necessary operations on ArangoSearch and MerkleTrees.

Unfortunately, to achieve taking a snapshot of RocksDB including the WAL files, a small patch is applied to our RocksDB fork. Future work can include upstreaming that patch.

Enterprise: https://github.com/arangodb/enterprise/pull/1267

…/arangodb into feature/block-commit-hot-backup

arangod/RocksDBEngine/Methods/RocksDBTrxMethods.cpp

arangod/StorageEngine/HotBackup.cpp

arangod/Transaction/Manager.h

arangod/Transaction/Manager.cpp

…-hot-backup

neunhoef

Review not yet complete, but already some comments, publishing for earlier visibility.

arangod/Cluster/ClusterMethods.cpp

neunhoef · 2023-07-13T08:29:21Z

arangod/Cluster/ClusterMethods.cpp

      if (!result.ok()) {
-        unlockDBServerTransactions(pool, backupId, lockedServers);
+        unlockServersTrxCommit(pool, backupId, lockedServers);


I know that this is not a new change of this PR, but what if a lock request is sent and runs into a timeout? Then this particular coordinator might still be locked and we do not even try to roll back the lock here, since we are only contacting the lockedServers. Should we not - as in the cases further down - try to unlock all coordinators again?

I'm against changing this in this PR:

This PR is already big enough and it is not a new issue.

The code in question is a huge mess. It is more likely for me to break it than fix anything useful. It has to be reimplemented from ground up.

The existing logic is good enough for dbservers, I would argue it is good enough for coordinators. There was probably a reason for why to do it like that.

However, I'm not saying that this should never be change. The hotbackup code (in particular the coordinator code) needs a rewrite, in particular since the assumptions have changed over time and chances are high we could come up with simpler code that works more reliably.

Can we adress this in a BTS follow-up?

arangod/Cluster/ClusterMethods.cpp

arangod/RocksDBEngine/Methods/RocksDBTrxMethods.cpp

arangod/Transaction/Manager.cpp

Dronplane

Maybe I've missed that, but do we test here that ArangoSearch view documents are all present (and nothing duplicated) after restoring hotbackup?

neunhoef

Apart from the things I have so far reported: LGTM. I will now move on to the enterprise part.

joerg84

LGTM, the open point(s) can imo be addressed later

joerg84 · 2023-07-17T09:29:19Z

Maybe I've missed that, but do we test here that ArangoSearch view documents are all present (and nothing duplicated) after restoring hotbackup?

Afaik it was

maierlars self-assigned this May 10, 2023

cla-bot bot added the cla-signed label May 10, 2023

markuspf force-pushed the feature/block-commit-hot-backup branch 2 times, most recently from 7f51fbc to b16f412 Compare June 8, 2023 10:52

maierlars and others added 12 commits June 9, 2023 14:12

Acquire lock during commit. Not during the whole trx.

978a46b

Also send lock requests to coordinators.

b93dcc5

Only lock coordinators.

ca33b89

Hold hotbackup read lock during intermediate commit.

f51481f

Adding more information to error message.

33dda21

Fixing c++ test.

5acf450

Remove log devel.

3bfcb12

Remove log devel.

4037b1a

Simplify test.

615e8c1

Add tick to search snapshot.

551e3a3

Return value

c1e87e6

Implement WAL tailing for hot backup

7ff75b8

markuspf force-pushed the feature/block-commit-hot-backup branch from b16f412 to 7ff75b8 Compare June 20, 2023 14:21

Markus Pfeiffer added 13 commits June 21, 2023 13:53

Hotbackup for IResearch

f651817

IResearch HotBackup

d0e74ac

Some deletes

3ed9aa7

Add copyright headers

c933d7c

Implement our own WAL tailer

6966a0c

Clean up and simplify RocksDBWalTailer

a39b2b9

Add index pointer to rocksbackup helper

b98752e

Simplify HotBackup code

dc30439

Fixes to have it not crash on me

f6c21fe

Move IResearchDataStoreHotbackupHelper to enterprise repo

8409658

Merge branch 'devel' into feature/block-commit-hot-backup

27fffb7

Require StorageEngine

020e4ed

Document the behaviour of the WalTailer for future-me

4b40446

maierlars and others added 4 commits July 10, 2023 19:54

Code cleanup.

20559ed

Improvements in HotBackupConsistencyTest.py

ec2f07a

Merge branch 'feature/block-commit-hot-backup' of github.com:arangodb…

8b82791

…/arangodb into feature/block-commit-hot-backup

Remove useless commit lock.

22662b3

jsteemann added this to the devel milestone Jul 11, 2023