Remove double serialization; use software encoder for fee estimation #21966

sipa · 2021-05-17T01:38:46Z

Based on #21981.

This adds a software-based platform-independent float/double encoder/decoder (platform independent in the sense that it only uses arithmetic and library calls, but never inspects the binary representation). This should strengthen our guarantee that encoded float/double values are portable across platforms. It then removes the functionality to serialize doubles from serialize.h, and replaces its only (non-test) use for fee estimation data serialization with the software encoder.

At least on x86/ARM, the only difference should be how certain NaN values are encoded/decoded (but not whether they are NaN or not).

It comes with tests that verify on is_iec559 platforms (which are the only ones we support, at least for now) that the serialized bytes exactly match the binary representation of floats in memory (for non-NaN).

DrahtBot · 2021-05-17T03:43:56Z

The following sections might be updated with supplementary metadata relevant to reviewers and maintainers.

Conflicts

Reviewers, this pull request conflicts with the following ones:

Remove unused float serialization #21981 (Remove unused float serialization by MarcoFalke)
refactor: Switch serialize to uint8_t (Bundle 1/2) #21969 (refactor: Switch serialize to uint8_t (Bundle 1/2) by MarcoFalke)
Add fee_est tool for debugging fee estimation code #10443 (Add fee_est tool for debugging fee estimation code by ryanofsky)

If you consider this pull request important, please also help to review the conflicting pull requests. Ideally, start with the one that should be merged first.

practicalswift · 2021-05-17T09:40:30Z

Strong Concept ACK

laanwj · 2021-05-17T10:23:59Z

While I think this is intriguing technically, I'm ~0 on this conceptually

I think using floating point in places where 100% precision or portability across platforms is important is mistaken in the first place. This gives the wrong impression.

It raises deeper questions for me: do we really need floating point in Bitcoin at all? Narrower: do we really need to serialize/deserialize it?
(clearly it's already not used for anything consensus critical)

maflcko · 2021-05-17T10:28:14Z

Haven't checked, but I presume it is only used for fees.dat?

laanwj · 2021-05-17T10:42:10Z

If so, I'd prefer to find an alternative way of serializing those values specifically (as fixed-point or numerator / denominator pairs of integers) and dropping the general float/double (de)serialization.

practicalswift · 2021-05-17T14:08:51Z

Narrower: do we really need to serialize/deserialize it?

As MarcoFalke discovered ser_float_to_uint32 and ser_uint32_to_float are currently unused (#21981), and AFAICT TxConfirmStats::Write and TxConfirmStats::Read are the only remaining users of ser_double_to_uint64 and ser_uint64_to_double.

Looks promising! :)

sipa · 2021-05-17T18:10:28Z

@laanwj That"s fair. I agree that avoiding serialization of floating point values directly is much more desirable.

My thinking here is that it effectively gives us a well-specified serialization with testable properties without needing to break compatibility with existing files.

How about rebasing this PR on top of #21981, and making it remove serialization support for double too in serialization.h, and instead make the feedata writing/reading code invoke EncodeDouble/DecodeDouble directly?

laanwj · 2021-05-18T07:39:18Z

How about rebasing this PR on top of #21981, and making it remove serialization support for double too in serialization.h, and instead make the feedata writing/reading code invoke EncodeDouble/DecodeDouble directly?

I was about to comment this! Let's make this code private to the single case where it is used? It keeps the current format but also prevents future serialization of these types 😄

sipa · 2021-05-18T19:55:40Z

@laanwj Done.

practicalswift · 2021-05-18T22:06:33Z

cr ACK 892522d: patch looks correct :)

Very happy to see ser_double_to_uint64/ser_uint64_to_double go and src/compat/assumptions.h shrink :)

laanwj · 2021-05-19T08:23:35Z

Thanks! Happy to see this now.

~~Code review ACK 892522d~~
Code review re-ACK 66545da

src/test/serfloat_tests.cpp

practicalswift · 2021-05-25T19:08:09Z

src/test/fuzz/float.cpp

-        stream >> f_deserialized;
-        assert(f == f_deserialized);
+        uint64_t encoded = EncodeDouble(d);
+        if constexpr (std::numeric_limits<double>::is_iec559) {


Note to other reviewers: We assume this to hold true in assumptions.h:

static_assert(std::numeric_limits<double>::is_iec559, "IEEE 754 double assumed");

practicalswift · 2021-05-25T19:08:29Z

src/test/serfloat_tests.cpp

+    BOOST_CHECK_EQUAL(TestDouble(785.066650390625), 0x4088888880000000ULL);
+
+    // Roundtrip test on IEC559-compatible systems
+    if (std::numeric_limits<double>::is_iec559) {


Note to other reviewers: We assume this to hold true in assumptions.h:

static_assert(std::numeric_limits<double>::is_iec559, "IEEE 754 double assumed");

I should remove that assumption now, it's no longer needed.

I think we need that assumption as long as we're doing floating-point division by zero? I still think we do that in ConnectBlock, CreateTransaction and EstimateMedianVal :)

Ah, good point.

Maybe a weaker assumption could do there. In any case, it's out of scope for this PR. Happy to leave it as it is for the foreseeable future unless anyone has good reason to work on porting bitcoind to non-IEC559 platforms.

(in which case I heartily suggest: please get rid of all floating point code. it shouldn't be necessary in financial-ish code)

practicalswift · 2021-05-25T19:09:58Z

cr re-ACK 66545da

…r for fee estimation 66545da Remove support for double serialization (Pieter Wuille) fff1cae Convert uses of double-serialization to {En,De}codeDouble (Pieter Wuille) afd964d Convert existing float encoding tests (Pieter Wuille) bda33f9 Add unit tests for serfloat module (Pieter Wuille) 2be4cd9 Add platform-independent float encoder/decoder (Pieter Wuille) e40224d Remove unused float serialization (MarcoFalke) Pull request description: Based on bitcoin#21981. This adds a software-based platform-independent float/double encoder/decoder (platform independent in the sense that it only uses arithmetic and library calls, but never inspects the binary representation). This should strengthen our guarantee that encoded float/double values are portable across platforms. It then removes the functionality to serialize doubles from serialize.h, and replaces its only (non-test) use for fee estimation data serialization with the software encoder. At least on x86/ARM, the only difference should be how certain NaN values are encoded/decoded (but not *whether* they are NaN or not). It comes with tests that verify on is_iec559 platforms (which are the only ones we support, at least for now) that the serialized bytes exactly match the binary representation of floats in memory (for non-NaN). ACKs for top commit: laanwj: Code review re-ACK 66545da practicalswift: cr re-ACK 66545da Tree-SHA512: 62ad9adc26e28707b2eb12a919feefd4fd10cf9032652dbb1ca1cc97638ac21de89e240858e80d293d5112685c623e58affa3d316a9783ff0e6d291977a141f5

maflcko

review ACK

partial bitcoin#15638, bitcoin#21966, bitcoin#16889, merge bitcoin#14555, bitcoin#20499, bitcoin#14074, bitcoin#17073: util refactoring

sipa force-pushed the 202105_softfloat branch 2 times, most recently from f4232db to 8b0b06c Compare May 17, 2021 02:02

DrahtBot added Build system Utils/log/libs labels May 17, 2021

sipa removed the Build system label May 17, 2021

sipa force-pushed the 202105_softfloat branch from 8b0b06c to 45add0c Compare May 17, 2021 02:57

DrahtBot mentioned this pull request May 17, 2021

Document that ser_float_to_uint32 is not the inverse of ser_uint32_to_float #21964

Closed

DrahtBot mentioned this pull request May 17, 2021

Remove unused float serialization #21981

Closed

sipa force-pushed the 202105_softfloat branch from 45add0c to 268fe01 Compare May 18, 2021 19:47

sipa changed the title ~~Software float encoding~~ Remove double serialization; use software encoder for fee estimation May 18, 2021

sipa force-pushed the 202105_softfloat branch from 268fe01 to 892522d Compare May 18, 2021 19:54

DrahtBot mentioned this pull request May 19, 2021

refactor: Switch serialize to uint8_t (Bundle 1/2) #21969

Merged

maflcko reviewed May 20, 2021

View reviewed changes

src/test/serfloat_tests.cpp Show resolved Hide resolved

maflcko reviewed May 20, 2021

View reviewed changes

src/test/serfloat_tests.cpp Outdated Show resolved Hide resolved

MarcoFalke and others added 5 commits May 24, 2021 16:04

Remove unused float serialization

e40224d

Add platform-independent float encoder/decoder

2be4cd9

Add unit tests for serfloat module

bda33f9

Convert existing float encoding tests

afd964d

Convert uses of double-serialization to {En,De}codeDouble

fff1cae

Remove support for double serialization

66545da

sipa force-pushed the 202105_softfloat branch from 892522d to 66545da Compare May 24, 2021 23:15

practicalswift reviewed May 25, 2021

View reviewed changes

laanwj merged commit 707ba86 into bitcoin:master May 26, 2021

DrahtBot mentioned this pull request May 26, 2021

Add fee_est tool for debugging fee estimation code #10443

Closed

maflcko reviewed Jun 7, 2021

View reviewed changes

kwvg pushed a commit to kwvg/dash that referenced this pull request Jun 16, 2021

partial bitcoin#21966: Add platform-independent float encoder/decoder

15c67f4

kwvg added a commit to kwvg/dash that referenced this pull request Jun 16, 2021

partial bitcoin#21966: Add platform-independent float encoder/decoder

c8cd8bf

kwvg mentioned this pull request Jun 16, 2021

partial bitcoin#15638, #21966, #16889, merge #14555, #20499, #14074, #17073: util refactoring dashpay/dash#4197

Merged

kwvg added a commit to kwvg/dash that referenced this pull request Jun 16, 2021

partial bitcoin#21966: Add platform-independent float encoder/decoder

cf8d5a9

kwvg added a commit to kwvg/dash that referenced this pull request Jun 16, 2021

partial bitcoin#21966: Add platform-independent float encoder/decoder

43f1909

kwvg added a commit to kwvg/dash that referenced this pull request Jun 24, 2021

partial bitcoin#21966: Add platform-independent float encoder/decoder

f42dcbb

kwvg added a commit to kwvg/dash that referenced this pull request Jun 25, 2021

partial bitcoin#21966: Add platform-independent float encoder/decoder

d129516

kwvg added a commit to kwvg/dash that referenced this pull request Jun 25, 2021

partial bitcoin#21966: Add platform-independent float encoder/decoder

e8d0970

kwvg added a commit to kwvg/dash that referenced this pull request Jun 26, 2021

partial bitcoin#21966: Add platform-independent float encoder/decoder

eecf91c

kwvg added a commit to kwvg/dash that referenced this pull request Jun 26, 2021

partial bitcoin#21966: Add platform-independent float encoder/decoder

441a37f

kwvg added a commit to kwvg/dash that referenced this pull request Jun 27, 2021

partial bitcoin#21966: Add platform-independent float encoder/decoder

7d31914

kwvg added a commit to kwvg/dash that referenced this pull request Jun 27, 2021

partial bitcoin#21966: Add platform-independent float encoder/decoder

f946c68

UdjinM6 added a commit to dashpay/dash that referenced this pull request Jun 27, 2021

Merge pull request #4197 from kittywhiskers/utilRefactor

7d664c7

partial bitcoin#15638, bitcoin#21966, bitcoin#16889, merge bitcoin#14555, bitcoin#20499, bitcoin#14074, bitcoin#17073: util refactoring

gades pushed a commit to cosanta/cosanta-core that referenced this pull request May 1, 2022

partial bitcoin#21966: Add platform-independent float encoder/decoder

4915046

gwillen pushed a commit to ElementsProject/elements that referenced this pull request Jun 1, 2022

Merge 707ba86 into merged_master (Bitcoin PR bitcoin/bitcoin#21966)

95ce7de

bitcoin locked as resolved and limited conversation to collaborators Aug 18, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove double serialization; use software encoder for fee estimation #21966

Remove double serialization; use software encoder for fee estimation #21966

sipa commented May 17, 2021 •

edited

Loading

DrahtBot commented May 17, 2021 •

edited

Loading

practicalswift commented May 17, 2021

laanwj commented May 17, 2021 •

edited

Loading

maflcko commented May 17, 2021

laanwj commented May 17, 2021 •

edited

Loading

practicalswift commented May 17, 2021

sipa commented May 17, 2021

laanwj commented May 18, 2021

sipa commented May 18, 2021

practicalswift commented May 18, 2021

laanwj commented May 19, 2021 •

edited

Loading

practicalswift May 25, 2021

practicalswift May 25, 2021

sipa May 25, 2021

practicalswift May 25, 2021

sipa May 25, 2021

laanwj May 26, 2021 •

edited

Loading

practicalswift commented May 25, 2021

maflcko left a comment

Remove double serialization; use software encoder for fee estimation #21966

Remove double serialization; use software encoder for fee estimation #21966

Conversation

sipa commented May 17, 2021 • edited Loading

DrahtBot commented May 17, 2021 • edited Loading

Conflicts

practicalswift commented May 17, 2021

laanwj commented May 17, 2021 • edited Loading

maflcko commented May 17, 2021

laanwj commented May 17, 2021 • edited Loading

practicalswift commented May 17, 2021

sipa commented May 17, 2021

laanwj commented May 18, 2021

sipa commented May 18, 2021

practicalswift commented May 18, 2021

laanwj commented May 19, 2021 • edited Loading

practicalswift May 25, 2021

Choose a reason for hiding this comment

practicalswift May 25, 2021

Choose a reason for hiding this comment

sipa May 25, 2021

Choose a reason for hiding this comment

practicalswift May 25, 2021

Choose a reason for hiding this comment

sipa May 25, 2021

Choose a reason for hiding this comment

laanwj May 26, 2021 • edited Loading

Choose a reason for hiding this comment

practicalswift commented May 25, 2021

maflcko left a comment

Choose a reason for hiding this comment

sipa commented May 17, 2021 •

edited

Loading

DrahtBot commented May 17, 2021 •

edited

Loading

laanwj commented May 17, 2021 •

edited

Loading

laanwj commented May 17, 2021 •

edited

Loading

laanwj commented May 19, 2021 •

edited

Loading

laanwj May 26, 2021 •

edited

Loading