test: Implicitly sync after generate* to preempt races and intermittent test failures #22567

maflcko · 2021-07-28T09:30:05Z

The most frequent failure in functional tests are intermittent races. Fixing such bugs is cumbersome because it involves:

Noticing the failure
Fetching and reading the log to determine the test case that failed
Adding a self.sync_all() where it was forgotten
Spamming out a pr and waiting for review, which is already sparse

Also, writing a linter to catch those is not possible, nor is review effective in finding these bugs prior to merge.

Fix all future intermittent races caused by a missing sync_block call by calling sync_all implicitly after each generate*, unless opted out. This ensures that the code is race-free (with regards to blocks) when the tests pass once, instead of our current approach where the code can never be guaranteed to be race-free.

maflcko · 2021-07-28T09:34:25Z

This is exactly identical functionality like #20362 (comment), but implemented differently.

Here the sync happens in the test framework, which makes the diff larger. In the other pull the sync happens in the test node.

DrahtBot · 2021-07-28T10:02:33Z

The following sections might be updated with supplementary metadata relevant to reviewers and maintainers.

Conflicts

Reviewers, this pull request conflicts with the following ones:

#23371 (test: MiniWallet: add P2TR support and use it per default by theStack)
#23365 (index: Fix backwards search for bestblock by mzumsande)
#23127 (tests: Use test framework utils where possible by vincenzopalazzo)
#23075 (refactoring: Fee estimation functional test cleanups by darosior)
#22364 (wallet: Make a tr() descriptor by default [DO NOT MERGE UNTIL TAPROOT ACTIVATES] by achow101)
#21726 (Improve Indices on pruned nodes via prune blockers by fjahr)
#21283 (Implement BIP 370 PSBTv2 by achow101)
#19461 (multiprocess: Add bitcoin-gui -ipcconnect option by ryanofsky)
#19460 (multiprocess: Add bitcoin-wallet -ipcconnect option by ryanofsky)
#10102 (Multiprocess bitcoin by ryanofsky)

If you consider this pull request important, please also help to review the conflicting pull requests. Ideally, start with the one that should be merged first.

practicalswift · 2021-07-29T09:05:45Z

Strong Concept ACK

Intermittent test failures are the worst!

rajarshimaitra

Strong concept ACK.

But I am getting certain failures when I am running all the tests together with test/function/test_runner.py -j 10.

I am not sure if that's something to do with this PR or it's some issue at my end.

They are also happening in master.
Failed tests are passing when I run them independently.

Failure summary :

wallet_watchonly.py --legacy-wallet                | ✓ Passed  | 22 s
wallet_watchonly.py --usecli --legacy-wallet       | ✓ Passed  | 25 s
feature_backwards_compatibility.py --descriptors   | ○ Skipped | 0 s
feature_backwards_compatibility.py --legacy-wallet | ○ Skipped | 0 s
feature_taproot.py --previous_release              | ○ Skipped | 0 s
mempool_compatibility.py                           | ○ Skipped | 0 s
wallet_upgradewallet.py --legacy-wallet            | ○ Skipped | 0 s
feature_coinstatsindex.py                          | ✖ Failed  | 68 s
feature_config_args.py                             | ✖ Failed  | 28 s
rpc_invalidateblock.py                             | ✖ Failed  | 63 s
wallet_backup.py --legacy-wallet                   | ✖ Failed  | 317 s
wallet_dump.py --legacy-wallet                     | ✖ Failed  | 168 s

One sample failure log:

202/215 - feature_config_args.py failed, Duration: 28 s

stdout:
2021-07-29T13:11:46.329000Z TestFramework (INFO): Initializing test directory /tmp/test_runner_₿_🏃_20210729_182102/feature_config_args_9
2021-07-29T13:12:00.419000Z TestFramework (INFO): Test config args logging
2021-07-29T13:12:02.455000Z TestFramework (INFO): Test seed peers
2021-07-29T13:12:12.318000Z TestFramework (ERROR): Assertion failed
Traceback (most recent call last):
  File "/home/raj/github-repo/mybitcoin/bitcoin/test/functional/test_framework/test_framework.py", line 128, in main
    self.run_test()
  File "/home/raj/github-repo/mybitcoin/bitcoin/test/functional/feature_config_args.py", line 221, in run_test
    self.test_seed_peers()
  File "/home/raj/github-repo/mybitcoin/bitcoin/test/functional/feature_config_args.py", line 186, in test_seed_peers
    self.start_node(0, extra_args=['-dnsseed=0', '-fixedseeds=1'])
  File "/usr/lib/python3.8/contextlib.py", line 120, in __exit__
    next(self.gen)
  File "/home/raj/github-repo/mybitcoin/bitcoin/test/functional/test_framework/test_node.py", line 408, in assert_debug_log
    self._raise_assertion_error('Expected messages "{}" does not partially match log:\n\n{}\n\n'.format(str(expected_msgs), print_log))
  File "/home/raj/github-repo/mybitcoin/bitcoin/test/functional/test_framework/test_node.py", line 166, in _raise_assertion_error
    raise AssertionError(self._node_msg(msg))
AssertionError: [node 0] Expected messages "['Loaded 0 addresses from peers.dat', 'DNS seeding disabled', 'Adding fixed seeds as -dnsseed=0, -addnode is not provided and all -seednode(s) attempted\n']" does not partially match log:

tryphe · 2021-07-30T04:08:56Z

Concept ACK

Is it correct to say the tests will be more upfront in catching syncing issues because, other than being race-free, the syncing happens within the test framework? Versus #20362. Just trying to understand the tradeoffs here despite the bigger diff.

Either way, I'm all in favor to merging this one ASAP to avoid an impending avalanche of conflicting PRs :)

maflcko · 2021-07-30T10:07:13Z

I am not sure if that's something to do with this PR or it's some issue at my end.
They are also happening in master.
Failed tests are passing when I run them independently.

If the tests also fail on master, then it can't be due to this PR. You can file an issue and hope it makes someone fix the issue or fix the issue yourself.

Is it correct to say the tests will be more upfront in catching syncing issues because, other than being race-free, the syncing happens within the test framework? Versus #20362.

As explained in the first comment (#22567 (comment)), the two are supposed to be functionally identical.

jonatack · 2021-09-04T15:39:03Z

Maybe name the sync_fun param simply sync; it seems just as clear (people will see what type of values are passed) and is shorter.

Looks like two tests needed updated copyrights; scripted-diff check fails in the last commit 09fbdc3:

$ test/lint/commit-script-check.sh f4e12fd..09fbdc3

diff --git a/test/functional/rpc_signmessagewithprivkey.py b/test/functional/rpc_signmessagewithprivkey.py
index 27aee44d25..80555eab75 100755
--- a/test/functional/rpc_signmessagewithprivkey.py
+++ b/test/functional/rpc_signmessagewithprivkey.py
@@ -1,5 +1,5 @@
 #!/usr/bin/env python3
-# Copyright (c) 2016-2020 The Bitcoin Core developers
+# Copyright (c) 2016-2021 The Bitcoin Core developers
 # Distributed under the MIT software license, see the accompanying
 # file COPYING or http://www.opensource.org/licenses/mit-license.php.
 """Test RPC commands for signing messages with private key."""
diff --git a/test/functional/wallet_signmessagewithaddress.py b/test/functional/wallet_signmessagewithaddress.py
index bf6f95e3f1..74a8f2eef2 100755
--- a/test/functional/wallet_signmessagewithaddress.py
+++ b/test/functional/wallet_signmessagewithaddress.py
@@ -1,5 +1,5 @@
 #!/usr/bin/env python3
-# Copyright (c) 2016-2019 The Bitcoin Core developers
+# Copyright (c) 2016-2021 The Bitcoin Core developers
 # Distributed under the MIT software license, see the accompanying
 # file COPYING or http://www.opensource.org/licenses/mit-license.php.
 """Test Wallet commands for signing and verifying messages."""
Failed

fa0b916 scripted-diff: Use generate* from TestFramework (MarcoFalke) Pull request description: This is needed for #22567. By using the newly added `generate*` member functions of the test framework, it paves the way to make it easier to implicitly call `sync_all` after block generation to avoid intermittent issues. ACKs for top commit: jonatack: ACK fa0b916 Tree-SHA512: e74a324b60250a87c08847cdfd7b6ce3e1d89b891659fd168f6dd7dc0aa718d0edd28285374a613f462f34f4ef8e12c90ad44fb58721c91b2ea691406ad22c2a

fac62e6 test: Delete generate* calls from TestNode (MarcoFalke) fac7f61 test: Use generate* node RPC, not wallet RPC (MarcoFalke) faac1cd test: Use generate* from TestFramework, not TestNode (MarcoFalke) Pull request description: Deleting the methods is needed for #22567 to pave the way to make it easier to implicitly call the `sync_all` member function. Without the methods being deleted, nothing prevents developers from adding calls to it. As history showed, developers *will* add calls to it. For example, see commit eb02dbb from today or the first commit in this pull request. ACKs for top commit: stratospher: Tested ACK fac62e6. brunoerg: tACK fac62e6 promag: Code review ACK fac62e6. Tree-SHA512: 6d4dea8f95ead954acfef2e6a5d98897ce0c2d02265c5b137bb149d0265543bd51d7e8403e1945b9af75df5524ca50064fe1d2a432b25c8abc71bbb28ed6ed53

-BEGIN VERIFY SCRIPT- perl -0777 -pi -e 's/(generate[^\n]*\)[^\n]*)(\n|\s)+self.sync_.*\n/\1\n/g' $(git grep -l generate ./test) -END VERIFY SCRIPT-

The previous diff touched most files in ./test/, so bump the headers to avoid having to touch them again for a bump later. -BEGIN VERIFY SCRIPT- ./contrib/devtools/copyright_header.py update ./test/ -END VERIFY SCRIPT-

fac62e6 test: Delete generate* calls from TestNode (MarcoFalke) fac7f61 test: Use generate* node RPC, not wallet RPC (MarcoFalke) faac1cd test: Use generate* from TestFramework, not TestNode (MarcoFalke) Pull request description: Deleting the methods is needed for bitcoin#22567 to pave the way to make it easier to implicitly call the `sync_all` member function. Without the methods being deleted, nothing prevents developers from adding calls to it. As history showed, developers *will* add calls to it. For example, see commit eb02dbb from today or the first commit in this pull request. ACKs for top commit: stratospher: Tested ACK fac62e6. brunoerg: tACK fac62e6 promag: Code review ACK fac62e6. Tree-SHA512: 6d4dea8f95ead954acfef2e6a5d98897ce0c2d02265c5b137bb149d0265543bd51d7e8403e1945b9af75df5524ca50064fe1d2a432b25c8abc71bbb28ed6ed53

…nless opted out facc352 test: Implicitly sync after generate*, unless opted out (MarcoFalke) Pull request description: The most frequent failure in functional tests are intermittent races. Fixing such bugs is cumbersome because it involves: * Noticing the failure * Fetching and reading the log to determine the test case that failed * Adding a `self.sync_all()` where it was forgotten * Spamming out a pr and waiting for review, which is already sparse Also, writing a linter to catch those is not possible, nor is review effective in finding these bugs prior to merge. Fix all future intermittent races caused by a missing sync_block call by calling `sync_all` implicitly after each `generate*`, unless opted out. This ensures that the code is race-free (with regards to blocks) when the tests pass once, instead of our current approach where the code can never be guaranteed to be race-free. There are some scripted-diff cleanups (see bitcoin/bitcoin#22567), but they will be submitted in a follow-up to reduce the conflicts in this pull. ACKs for top commit: lsilva01: tACK facc352 on Ubuntu 20.04 brunoerg: tACK facc352 on MacOS 11.6 Tree-SHA512: 046a40a066b4a3bd28a3077bd654fa8887442dd1f0ec6fd11671865809ef02376f126eb667a1320ebd67b6e372c78c00dbf8bd25d86ed86f1d9a25363103ed97

maflcko · 2021-11-09T09:47:31Z

I've archived the discussion here and created a follow-up for the remaining (scripted-diff) changes: #23474

Maybe name the sync_fun param simply sync; it seems just as clear (people will see what type of values are passed) and is shorter.

Sounds acceptable. Should I add a scripted-diff for that to the other pull?

maflcko force-pushed the 2107-testSync branch from d0cd7a0 to faf7a45 Compare July 28, 2021 09:30

fanquake added the Tests label Jul 28, 2021

maflcko mentioned this pull request Jul 28, 2021

test: Implicitly sync after generate* to preempt races and intermittent test failures #20362

Closed

maflcko force-pushed the 2107-testSync branch from faf7a45 to 71dfdee Compare July 28, 2021 11:01

maflcko force-pushed the 2107-testSync branch from 71dfdee to fa0959c Compare July 28, 2021 12:57

DrahtBot added the Needs rebase label Jul 28, 2021

maflcko force-pushed the 2107-testSync branch from fa0959c to faf47c9 Compare July 28, 2021 15:00

DrahtBot removed the Needs rebase label Jul 28, 2021

This was referenced Jul 28, 2021

Implement BIP 370 PSBTv2 #21283

Open

rpc, test: Improve getblockstats for unspendables #19888

Merged

rajarshimaitra reviewed Jul 29, 2021

View reviewed changes

DrahtBot added the Needs rebase label Jul 30, 2021

maflcko force-pushed the 2107-testSync branch from 09fbdc3 to f2f5bea Compare September 9, 2021 12:26

This was referenced Sep 10, 2021

test: add addpeeraddress "tried", test addrman checks on restart with asmap #22831

Merged

Add Single Random Draw as an additional coin selection algorithm #17526

Merged

This was referenced Sep 21, 2021

test: Use MiniWallet in mempool_persist #23047

Merged

test: Fee estimation functional test cleanups #23075

Merged

Package-aware fee estimation #23074

Closed

jonatack mentioned this pull request Oct 5, 2021

Fix intermittent failure in wallet_send.py and rpc_fundrawtransaction.py #23200

Merged

maflcko force-pushed the 2107-testSync branch from f2f5bea to 322d994 Compare October 6, 2021 11:10

maflcko mentioned this pull request Oct 6, 2021

test: Delete generate* calls from TestNode #23207

Merged

MarcoFalke added 4 commits October 18, 2021 13:15

test: Implicitly sync after generate*, unless opted out

fa2e5a5

scripted-diff: Remove redundant sync_all

ffff4ad

-BEGIN VERIFY SCRIPT- perl -0777 -pi -e 's/(generate[^\n]*\)[^\n]*)(\n|\s)+self.sync_.*\n/\1\n/g' $(git grep -l generate ./test) -END VERIFY SCRIPT-

test: Use 4 spaces for indentation

fa74543

scripted-diff: Bump copyright headers

fad9d45

The previous diff touched most files in ./test/, so bump the headers to avoid having to touch them again for a bump later. -BEGIN VERIFY SCRIPT- ./contrib/devtools/copyright_header.py update ./test/ -END VERIFY SCRIPT-

maflcko force-pushed the 2107-testSync branch from 322d994 to fad9d45 Compare October 18, 2021 11:19

maflcko mentioned this pull request Oct 18, 2021

test: Implicitly sync after generate*, unless opted out #23300

Merged

This was referenced Oct 27, 2021

test: MiniWallet: add P2TR support and use it per default #23371

Closed

index: Fix backwards search for bestblock #23365

Merged

DrahtBot mentioned this pull request Nov 5, 2021

tests: Use test framework utils where possible #23127

Closed

maflcko marked this pull request as ready for review November 9, 2021 09:37

maflcko closed this Nov 9, 2021

maflcko deleted the 2107-testSync branch November 9, 2021 09:40

maflcko restored the 2107-testSync branch November 9, 2021 09:41

jnewbery mentioned this pull request May 27, 2022

CI timeout in test/functional/p2p_feefilter.py #22483

Closed

bitcoin locked and limited conversation to collaborators Nov 9, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test: Implicitly sync after generate* to preempt races and intermittent test failures #22567

test: Implicitly sync after generate* to preempt races and intermittent test failures #22567

maflcko commented Jul 28, 2021

maflcko commented Jul 28, 2021

DrahtBot commented Jul 28, 2021 •

edited

Loading

practicalswift commented Jul 29, 2021

rajarshimaitra left a comment

tryphe commented Jul 30, 2021 •

edited

Loading

maflcko commented Jul 30, 2021

jonatack commented Sep 4, 2021 •

edited

Loading

maflcko commented Nov 9, 2021

test: Implicitly sync after generate* to preempt races and intermittent test failures #22567

test: Implicitly sync after generate* to preempt races and intermittent test failures #22567

Conversation

maflcko commented Jul 28, 2021

maflcko commented Jul 28, 2021

DrahtBot commented Jul 28, 2021 • edited Loading

Conflicts

practicalswift commented Jul 29, 2021

rajarshimaitra left a comment

Choose a reason for hiding this comment

tryphe commented Jul 30, 2021 • edited Loading

maflcko commented Jul 30, 2021

jonatack commented Sep 4, 2021 • edited Loading

maflcko commented Nov 9, 2021

DrahtBot commented Jul 28, 2021 •

edited

Loading

tryphe commented Jul 30, 2021 •

edited

Loading

jonatack commented Sep 4, 2021 •

edited

Loading