Refactor BnB tests by murchandamus · Pull Request #29532 · bitcoin/bitcoin

murchandamus · 2024-03-01T22:37:48Z

This PR is splitting off some of the improvements made in #28985 and starts addressing the issues raised in #27754.

I aim to completely replace coinselector_tests with coinselection_tests. The goal is to generally use coins created per a nominal effective value so we can get away from testing with CoinSelectionParams that are non-representative and effectuate counterintuitive behavior such as feerate = 0 or cost_of_change = 0

DrahtBot · 2024-03-01T22:37:51Z

The following sections might be updated with supplementary metadata relevant to reviewers and maintainers.

Code Coverage & Benchmarks

For details see: https://corecheck.dev/bitcoin/bitcoin/pulls/29532.

Reviews

See the guideline for information on the review process.

Type	Reviewers
ACK	achow101, monlovesmango, w0xlt
Concept ACK	jasonribble, ismaelsadeeq, yancyribbens

If your review is incorrectly listed, please react with 👎 to this comment and the bot will ignore it on the next update.

Conflicts

No conflicts as of last run.

murchandamus · 2024-04-09T12:03:08Z

Pinging @furszy, @achow101, @S3RK, as discussed

src/wallet/test/coinselection_tests.cpp

furszy

I'm not completely sure about 584e524eb57444d7192df1049cafde9ccc480406. The commit description says

Originally these tests verified that at a SelectCoins level that a
solution with fewer inputs gets preferred at high feerates, and a
solution with more inputs gets preferred at low feerates. This outcome
relies on the behavior of BnB, so we move these tests under the umbrella
of BnB tests.

It is true that the outcome relies only on the BnB behavior currently but that might not be true in the future. There could be other algorithm clashing with it.

src/wallet/test/coinselection_tests.cpp

murchandamus · 2024-06-27T17:36:33Z

Alright, should hopefully be ready to review.

murchandamus · 2024-06-27T17:44:18Z

I'm not completely sure about 584e524. The commit description says

Originally these tests verified that at a SelectCoins level that a
solution with fewer inputs gets preferred at high feerates, and a
solution with more inputs gets preferred at low feerates. This outcome
relies on the behavior of BnB, so we move these tests under the umbrella
of BnB tests.

It is true that the outcome relies only on the BnB behavior currently but that might not be true in the future. There could be other algorithm clashing with it.

Yeah, so the old tests assumed that because BnB behaved a certain way, we would get a specific overall outcome. The new tests just check that BnB behaves a certain way. We might still want tests that test the overall outcome as a result from the interaction of multiple coin selection tests, but this one seemed clearly to be testing BnB behavior, and it seemed strange to me to be testing that at the level where the results are combined rather than checking that BnB assumptions are fulfilled by BnB.

DrahtBot · 2024-09-11T11:47:53Z

🚧 At least one of the CI tasks failed.
_{Debug: https://github.com/bitcoin/bitcoin/runs/29957535397}

Hints

Make sure to run all tests locally, according to the documentation.

The failure may happen due to a number of reasons, for example:

Possibly due to a silent merge conflict (the changes in this pull request being
incompatible with the current code in the target branch). If so, make sure to rebase on the latest
commit of the target branch.
A sanitizer issue, which can only be found by compiling with the sanitizer and running the
affected test.
An intermittent issue.

Leave a comment here, if you need help tracking down a confusing failure.

During the selection process, input values are converted to effective_values, and then the search begins using the calculated effective_values. In the test-suit, it's often the case that it's desired to test what happens given a particular effective_value. Instead of manually figuring out what absolute_value is needed to result in the search routines effective_value, add the ability to denote by effective_value in the test-suite. This is done by wrapping the value with e() otherwise it's assumed to be an absolute value. ``` For example: weighted_utxos: &["1 sat/68vb", "e(1 sat)/68 vb", "e(1 sat)/204 wu"] ``` This will evaluate to two `UTXOs` with equivalent effective_values of 1 sat even though they have different sizes (`68 vb` and `204 wu`). Also, there will be a `UTXO` with negative effective value since the absolute value is given as `1 sat/68 vb`. Computing the Utxo requires knowing the `fee_rate`, and the `fee_rate` is included in the `fee_rate` field not part of `weigthed_utxos`. Therefore, it's no longer useful to use `From::str` since a `Utxo` can no longer be constructed solely from the string value in `weighted_utxos`. Motivated by: bitcoin/bitcoin#29532

monlovesmango · 2025-04-16T18:09:45Z

As discussed during Bitcoin Core Review Club, at least one test with 0 fee rate should probably be added to avoid test coverage regression.

Thanks for hosting Murch!

src/wallet/test/CMakeLists.txt

src/wallet/test/coinselection_tests.cpp

w0xlt

nit: Perhaps the commit descriptions ("BnB rate sensitivity tests", "simple BnB failure tests", and "BnB iteration exhaustion test", for example) could become functions for clarity (not necessarily new test cases).

w0xlt

nit: Some commits can be squashed to avoid warnings like "unused function".

murchandamus · 2025-04-29T02:30:06Z

nit: Perhaps the commit descriptions ("BnB rate sensitivity tests", "simple BnB failure tests", and "BnB iteration exhaustion test", for example) could become functions for clarity (not necessarily new test cases).

I’m not sure I understand what you mean. Do you mean that I introduce helper functions like "TestBnBSuccess" for the new tests instead of making them their own test cases?

murchandamus · 2025-04-29T03:55:20Z

nit: Some commits can be squashed to avoid warnings like "unused function".

@w0xlt: I ran each commit, and I did not receive this warning. Could you let me know which commit resulted in that warning for you?

As discussed during Bitcoin Core Review Club, at least one test with 0 fee rate should probably be added to avoid test coverage regression.

@monlovesmango: I added another commit to run the simple tests at various feerates, including the feerates 0 sat/kvB, 1 sat/kvB, and 1,500,000 sat/kvB.

murchandamus · 2025-04-29T05:10:01Z

Rebased due to strange CI failures

Edit:
It seems to me that

the failure of wallet_backwards_compatibility.py got addressed via test: Use the correct node for doubled keypath test #32369
but I don’t see an issue or pull request regarding p2p_i2p_ports.py. Anyone have an idea?

achow101

I think the first 2 commits could be combined. The first commit is harder to review since it introduces a bunch of helpers that are completely unused. In fact, the file is introduced but not to the build system, so we can't even check if it compiles.

src/wallet/test/coinselection_tests.cpp

w0xlt

ACK dbf1f26

I ran each commit, and I did not receive this warning. Could you let me know which commit resulted in that warning for you?

It was caused by the coinselection_tests.cpp as you mentioned here: #29532 (comment)

Do you mean that I introduce helper functions like "TestBnBSuccess" for the new tests instead of making them their own test cases?

I was referring to something like the code below (which encapsulates the latest commit changes), but it was just a suggestion. The current code looks good to me too.

void iteration_exhaustion_test(...){

    std::vector<OutputGroup> doppelganger_pool;
    std::vector<CAmount> doppelgangers;
    std::vector<CAmount> expected_inputs;
    for (int i = 0; i < 17; ++i) {
// ...
    }
    AddCoins(doppelganger_pool, doppelgangers);
    // Among up to 17 unique UTXOs of similar effective value we will find a solution composed of the eight smallest UTXOs
    TestBnBSuccess("Combine smallest 8 of 17 unique UTXOs", doppelganger_pool, /*selection_target=*/8 * CENT, /*expected_input_amounts=*/expected_inputs);

    // Starting with 18 unique UTXOs of similar effective value we will not find the solution due to exceeding the attempt limit
    AddCoins(doppelganger_pool, {1 * CENT + default_cs_params.m_cost_of_change + 17});
    TestBnBFail("Exhaust looking for smallest 8 of 18 unique UTXOs", doppelganger_pool, /*selection_target=*/8 * CENT);
}

BOOST_AUTO_TEST_CASE(bnb_test) {
// ...
iteration_exhaustion_test(...);
}

Recreates the tests in a new test suite coinselection_tests.cpp that is based on UTXOs being created per their effective values rather than nominal values and uses transactions with non-zero feerates.

Originally these tests verified that at a SelectCoins level that a solution with fewer inputs gets preferred at high feerates, and a solution with more inputs gets preferred at low feerates. This outcome relies on the behavior of BnB, so we move these tests under the umbrella of BnB tests. Originally these tests relied on SFFO to work.

We do not need to repeat the same test multiple times because BnB is deterministic and will therefore always have the same outcome. Additionally, this test was redundant because it repeats the "Smallest combination too big" test.

monlovesmango

ACK dbf1f2663b1afbe03d6b1855f83db604bc79979e

I like how it now covers a variety of fee rates.

monlovesmango · 2025-05-01T02:44:59Z

src/wallet/test/coinselection_tests.cpp

Nit: this seems to be checking for a match, not a mismatch

Suggested change

BOOST_CHECK_MESSAGE(HaveEquivalentValues(expected_result, *result), strprintf("Result mismatch in BnB-Success: %s. Expected %s, but got %s", test_title, InputAmountsToString(expected_result), InputAmountsToString(*result)));

BOOST_CHECK_MESSAGE(HaveEquivalentValues(expected_result, *result), strprintf("Result match in BnB-Success: %s. Expected %s, and got %s", test_title, InputAmountsToString(expected_result), InputAmountsToString(*result)));

Right, it checks whether the expected result and the selected input set match, but the message here is printed in the case of a failure!

Ok I see! I understand now. Sorry was running with --log_level=all and was misinterpreting these as success messages. Please disregard..

Thanks for taking such a thorough look!

src/wallet/test/coinselection_tests.cpp

murchandamus · 2025-05-01T19:52:22Z

I was referring to something like the code below (which encapsulates the latest commit changes), but it was just a suggestion. The current code looks good to me too.

I see, thanks. I guess it could be nice to be able to run the test suite in smaller portions especially if some of the tests took a long time, but it overall runs extremely quickly, so I’m not sure it is necessary to further structure the tests at this time.

I intend to work on porting the other algorithms’ tests in the same manner, and I can pick this up in a follow-up commit, if we decide that’s the way to go then.

murchandamus · 2025-05-01T19:52:42Z

Should be ready to review, again.

achow101 · 2025-05-01T23:40:27Z

ACK 85368aa

monlovesmango · 2025-05-02T00:20:16Z

ACK 85368aa

furszy · 2025-05-02T19:56:28Z

src/wallet/test/coinselection_tests.cpp

+/** Make one OutputGroup with a single UTXO that either has a given effective value (default) or a given amount (`is_eff_value = false`). */
+static OutputGroup MakeCoin(const CAmount& amount, bool is_eff_value = true, CoinSelectionParams cs_params = default_cs_params, int custom_spending_vsize = 68)


As the is_eff_value arg is always true for all the tests. What about removing it?.
Also, you are expecting group.GetSelectionAmount() to be equal to tx.vout[0].nValue with this parameter right?. I'm unsure that will always be the case.
As a test, could try adding an assertion at the end of this function. I have a hunch it will crash.

I believe the plan is to rewrite all of the tests into this style over the course of a couple PRs. So this parameter is here to allow for the future testing of SFFO behavior.

Also, you are expecting group.GetSelectionAmount() to be equal to tx.vout[0].nValue with this parameter right?

Only when it is false. The purpose is to have group.GetSelectionAmount() be eqaul to amount, so nValue is increased by the amount in fees so that they match after fees are deducted.

w0xlt

ACK 85368aa

DrahtBot mentioned this pull request Mar 2, 2024

test: Add algo assert to bnb_search_test #29206

Closed

murchandamus force-pushed the 2024-03-coinselection_tests branch from d8ccf02 to 5d722bb Compare March 5, 2024 13:50

DrahtBot added CI failed and removed CI failed labels Mar 23, 2024

murchandamus marked this pull request as ready for review April 9, 2024 12:02

DrahtBot added CI failed and removed CI failed labels Apr 19, 2024

achow101 reviewed Apr 24, 2024

View reviewed changes

furszy reviewed May 3, 2024

View reviewed changes

src/wallet/test/coinselection_tests.cpp Outdated Show resolved Hide resolved

src/wallet/test/coinselection_tests.cpp Outdated Show resolved Hide resolved

src/wallet/test/coinselection_tests.cpp Outdated Show resolved Hide resolved

murchandamus mentioned this pull request Jun 7, 2024

Avoid changeless input sets when SFFO is active #28985

Closed

murchandamus force-pushed the 2024-03-coinselection_tests branch from 5d722bb to 504873c Compare June 27, 2024 15:57

murchandamus commented Jun 27, 2024

View reviewed changes

murchandamus force-pushed the 2024-03-coinselection_tests branch 2 times, most recently from 0bece89 to cbeb10b Compare June 27, 2024 17:35

DrahtBot mentioned this pull request Aug 16, 2024

build: Remove Autotools-based build system #30664

Merged

hebasto added Tests Needs CMake port labels Aug 16, 2024

DrahtBot added the Needs rebase label Sep 2, 2024

maflcko removed the Needs CMake port label Sep 3, 2024

murchandamus force-pushed the 2024-03-coinselection_tests branch from cbeb10b to 131bbd1 Compare September 10, 2024 20:32

DrahtBot removed the Needs rebase label Sep 10, 2024

DrahtBot added the CI failed label Sep 11, 2024

murchandamus force-pushed the 2024-03-coinselection_tests branch from 131bbd1 to 639a0dd Compare September 11, 2024 19:48

This was referenced Sep 12, 2024

In 3dff8f0490b4b92c4adf229a828063dfda6d3a80 "[test] Create coinselection_tests" Lepart23/cautious-octo-fiesta#2

Open

Thanks, fixed Lepart23/cautious-octo-fiesta#3

Open

w0xlt reviewed Apr 17, 2025

View reviewed changes

src/wallet/test/CMakeLists.txt Show resolved Hide resolved

src/wallet/test/coinselection_tests.cpp Outdated Show resolved Hide resolved

src/wallet/test/coinselection_tests.cpp Show resolved Hide resolved

w0xlt reviewed Apr 17, 2025

View reviewed changes

achow101 reviewed Apr 30, 2025

View reviewed changes

src/wallet/test/coinselection_tests.cpp Outdated Show resolved Hide resolved

src/wallet/test/coinselection_tests.cpp Outdated Show resolved Hide resolved

w0xlt reviewed Apr 30, 2025

View reviewed changes

murchandamus added 7 commits April 30, 2025 15:37

test: Recreate simple BnB success tests

2bafc46

Recreates the tests in a new test suite coinselection_tests.cpp that is based on UTXOs being created per their effective values rather than nominal values and uses transactions with non-zero feerates.

test: Recreate BnB clone skipping test

a94030a

test: Recreate simple BnB failure tests

4781f5c

test: Remove redundant repeated test

2a1b275

We do not need to repeat the same test multiple times because BnB is deterministic and will therefore always have the same outcome. Additionally, this test was redundant because it repeats the "Smallest combination too big" test.

test: Recreate BnB iteration exhaustion test

d610951

test: Run simple tests at various feerates

85368aa

monlovesmango reviewed May 1, 2025

View reviewed changes

furszy reviewed May 2, 2025

View reviewed changes

w0xlt reviewed May 2, 2025

View reviewed changes

murchandamus mentioned this pull request Aug 6, 2025

Slow unit tests delay functional tests and leave CPU unused #32770

Open

	BOOST_CHECK_MESSAGE(HaveEquivalentValues(expected_result, result), strprintf("Result mismatch in BnB-Success: %s. Expected %s, but got %s", test_title, InputAmountsToString(expected_result), InputAmountsToString(result)));
	BOOST_CHECK_MESSAGE(HaveEquivalentValues(expected_result, result), strprintf("Result match in BnB-Success: %s. Expected %s, and got %s", test_title, InputAmountsToString(expected_result), InputAmountsToString(result)));

		/** Make one OutputGroup with a single UTXO that either has a given effective value (default) or a given amount (`is_eff_value = false`). */
		static OutputGroup MakeCoin(const CAmount& amount, bool is_eff_value = true, CoinSelectionParams cs_params = default_cs_params, int custom_spending_vsize = 68)

Conversation

murchandamus commented Mar 1, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DrahtBot commented Mar 1, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Coverage & Benchmarks

Reviews

Conflicts

Uh oh!

murchandamus commented Apr 9, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

furszy left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

murchandamus commented Jun 27, 2024

Uh oh!

murchandamus commented Jun 27, 2024

Uh oh!

DrahtBot commented Sep 11, 2024

Uh oh!

monlovesmango commented Apr 16, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

w0xlt left a comment

Choose a reason for hiding this comment

Uh oh!

w0xlt left a comment

Choose a reason for hiding this comment

Uh oh!

murchandamus commented Apr 29, 2025

Uh oh!

murchandamus commented Apr 29, 2025

Uh oh!

murchandamus commented Apr 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

achow101 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

w0xlt left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

monlovesmango left a comment

Choose a reason for hiding this comment

Uh oh!

monlovesmango May 1, 2025

Choose a reason for hiding this comment

Uh oh!

murchandamus May 1, 2025

Choose a reason for hiding this comment

Uh oh!

monlovesmango May 1, 2025

Choose a reason for hiding this comment

Uh oh!

murchandamus May 1, 2025

Choose a reason for hiding this comment

murchandamus commented Mar 1, 2024 •

edited

Loading

DrahtBot commented Mar 1, 2024 •

edited

Loading

murchandamus commented Apr 29, 2025 •

edited

Loading

w0xlt left a comment •

edited

Loading

furszy May 2, 2025 •

edited

Loading