Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Normalize transaction values within the engine #10648

Merged
merged 1 commit into from
Aug 25, 2021

Conversation

nickchapman-da
Copy link
Contributor

@nickchapman-da nickchapman-da commented Aug 23, 2021

Normalize transaction values in the engine as they are converted from speedy-values.

  • Except in some limited cases! (scenario-runner, sandbox, some tests)
  • isReplayedBy now finally becomes structural equality!
  • Normalization controlled by a new EngineConfig flag (transactionNormalization)
  • which is passed when a speedy Machine is constructed

This PR advances #6665

@nickchapman-da nickchapman-da added the component/daml-engine DAML-LF Engine & Interpreter label Aug 23, 2021
@nickchapman-da nickchapman-da marked this pull request as ready for review August 23, 2021 14:44
Copy link
Collaborator

@remyhaemmerle-da remyhaemmerle-da left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I completely support the objective to replace irreplaceable by the structural equality.

This makes the jobs, however I not very fan of the current state:

  • The API of PartialTransaction now mixes Value and SValue
  • The version is inferred several times and in different places (In SBuiltin and in PartialTransaction). I would have preferred it to be done in one or the other.

Copy link
Contributor

@cocreature cocreature left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Mostly looks good but I’d like to understand how we handle byKey and if we really need to make normalization optional before approving.

Copy link
Contributor

@cocreature cocreature left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice, thanks!
Can you please post the list of sandbox test cases that fail if we enable normalization there before merging?

@nickchapman-da
Copy link
Contributor Author

Nice, thanks!
Can you please post the list of sandbox test cases that fail if we enable normalization there before merging?

will do

Copy link
Collaborator

@remyhaemmerle-da remyhaemmerle-da left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I like it more like that. Thanks a lot.

@nickchapman-da
Copy link
Contributor Author

If we remove transactionNormalization = false,
from ledger/sandbox-classic/src/main/scala/platform/sandbox/SandboxServer.scala
then the following tests fail:

//compiler/damlc/tests:repl
//language-support/ts/codegen/tests:build-and-lint-test
//ledger-service/http-json:integration-tests-ce
//ledger-service/http-json:integration-tests-ee
//ledger/sandbox-classic:conformance-test-exceptions-memory-1.14
//ledger/sandbox-classic:conformance-test-static-time-memory-1.14
//ledger/sandbox-classic:conformance-test-wall-clock-memory-1.14

@cocreature
Copy link
Contributor

That’s not good, it’s breaking verbose mode on the ledger API not just internal tests. I’m fine with disabling this in this PR but this really should go through the value enricher.

Create normalized TXs when a partial TX is finalised.

Except in limited cases! (i.e for scenario-runner, sandbox)

CHANGELOG_BEGIN
CHANGELOG_END

normalize values in the engine as they are converted from speedy-values

fix 2.12 build

backout redundant change

ensure byKey field is correctly normalized when constructed by engine

rename flag: valueNormalization -> transactionNormalization

improve comment

delete commented-out code

rename: toValueNorm --> toNormalizedValue

rename: (SValue.) toValue --> toUnNormalizedValue

revert changes to ptx so that the interface to insertCreate() etc is Value-based (not SValue-based)

improve comments

respell: toUnNormalizedValue --> toUnnormalizedValue

fix build
@nickchapman-da nickchapman-da force-pushed the nick-norm-values-within-engine branch from d393f91 to ee86ba4 Compare August 25, 2021 06:51
@cocreature
Copy link
Contributor

Looking at the code, the enricher is missing from

. Do you want to take a look at fixing this after this PR @nickchapman-da?

@nickchapman-da nickchapman-da merged commit 387c68b into main Aug 25, 2021
@nickchapman-da nickchapman-da deleted the nick-norm-values-within-engine branch August 25, 2021 08:53
@nickchapman-da
Copy link
Contributor Author

Looking at the code, the enricher is missing from

. Do you want to take a look at fixing this after this PR @nickchapman-da?

Sure

azure-pipelines bot pushed a commit that referenced this pull request Sep 1, 2021
This PR has been created by a script, which is not very smart
and does not have all the context. Please do double-check that
the version prefix is correct before merging.

@aherrmann-da is in charge of this release.

Commit log:
```
f058c2f DPP-368 clean up flags (#10711)
90ad24f  [Divulgence pruning] Prune immediate divulgence [DPP-513] (#10691)
183934b Command dedup: add columns to completions in append-only schemas [KVL-1057] (#10652)
27c1333 LF: Drop outdated TODOs (#10725)
9be577a Enable pruning in the sandbox-classic when the append-only schema is used [DPP-567] (#10708)
9f072ae Ledger-API Conformance test for Contract ID V0 (#10717)
867547c DPP-368 enable append-only flag in sandbox (#10710)
bdc511e [DPP-438] Change open-ended metric names into static ones (by removing partyName part) (#10706)
0c82006 [Divulgence pruning] Prune all divulged contracts only after migration offset [DPP-483] (#10661)
2555dbb Use soft references for values in the caches (#10715)
7fd5906 Add LedgerClientWithoutLedgerId next to the LedgerClient (#10681)
856c69c participant-integration-api: Increase a test timeout. (#10721)
b86d07d remove non functioning oracle json array indices (#10720)
f5e1756 sandbox-classic: Remove default parameters in `SqlLedger.Owner`. (#10718)
9ef3377 LF: Update specification with Contract ID Comparability check (#10703)
e5c4734 enable JSON search index on Oracle by default (#10539)
1ded42f [DPP-418] Protect TLS keys - follow-up cleanup (#10696)
3fcd986 Introduce a new `Offset` format [KVL-1063] (#10668)
a5781a6 update NOTICES file (#10714)
8985505 participant-integration-api: Use deadlines, not deduplication times, for expiring trackers. [KVL-1009] (#10704)
65025c2 sandbox-classic: Add ErrorInfo metadata for rejections. [KVL-1048] (#10707)
97bda3c LF: V1 Contract ID check in Preprocessor (#10687)
c2f90ef Add CLI option to force disabling of participant deduplication (#10698)
6016633 Construct ParsedModule directly in Daml Repl (#10701)
bbdf16a DPP-368 unhide append-only CLI flags (#10697)
a41b134 Use the tracker retention period as the maximum expiry time. [KVL-1009] (#10700)
e750ba5 Make warning less scary. (#10699)
5f120bd rotate release duty after 1.17.0-snapshot.20210824.7647.0.640fb683 (#10660)
a17253f DPP-535 Verify postgres version (#10577)
301ce53 participant-integration-api: Add tests for ApiCommandService. [KVL-1009] (#10689)
bd01a21 [DPP-418] Protect Participant TLS keys (#10629)
7ee1324 update NOTICES file (#10695)
7c392f3 update NOTICES file (#10693)
3db654e update NOTICES file (#10690)
eff09c0 ledger-api-client: Wrap command submissions in a new class. [KVL-1009] (#10683)
d54adb2  Ledger-API: Conformance tests for contract IDs suffixing (#10654)
aa2e869 [Divulgence pruning] Pass divulged contract arguments through kvutils Write/ReadService [DPP-535] (#10598)
1a78313 Disable DeeplyNestedValueIT suite against canton in Daml repo (#10686)
b5f9be3 participant-integration-api: Standardize tracker retention naming. (#10682)
2aa632e ledger-on-sql: Do not increment the dispatcher head on start. (#10684)
eabb19d [ledger-api] Add deduplication_duration to deduplication period [kvl-1047] (#10676)
96ad9b5 [Divulgence pruning] All divulgence events pruning [DPP-483] (#10634)
0b7980d Update rules_haskell (#10674)
284edfc Fix FlywayMigrations datasource (#10666)
adbe65f Document ActionFail vs CanAbort (#10657)
52e7a6d update compat versions for 1.17.0-snapshot.20210824.7647.0.640fb683 (#10667)
f42e6b6 Expose pending contracts in triggers (#10672)
7cc6989 Add multiple ways of specifying deduplication [KVL-1047] (#10601)
53be19f participant-integration-api: Ensure that all waiting, failed, and closed trackers are cleaned up. (#10662)
b27cde6 participant-integration-api: Move tracker code around, and tidy up tests. (#10663)
fc9d359 Drop alternative rules from dlint config (#10646)
5204d3c Include committers in PartialTransaction root context (#10665)
387c68b Normalize transaction values within the engine (#10648)
430c1cc release 1.17.0-snapshot.20210824.7647.0.640fb683 (#10659)
ef239fd participant-integration-api: Move `TrackerMap` code around. [KVL-1009] (#10653)
```
Changelog:
```
- [Sandbox] - Participant pruning is enabled in the sandbox-classic when the append-only schema is used

- [Ledger Client Scala Bindings] A new variant of the LedgerClient class
    was added called `LedgerClientWithoutLedgerId`. This class does not
    need a ledger id at initialization. It was added to allow skipping
    any checks at initialization for use cases where either the
    ledger id is not known at initalization or no valid token can be fed
    at initialization for checking the ledger id. Furthermore for each
    classes `ActiveContractSetClient`, `CommandClient`, `PackageClient`,
    `TransactionClient`, `VersionClient` now exists a variant which
    doesn't depend on a ledger id at initialization and instead requires
    one for every function as parameter. Moreover the existing classes
    are extending these classes with overriding the methods and setting
    the default of the parameter with the given ledger id from
    initialization. The class `LedgerClientWithoutLedgerId` already
    makes usage of these variants e.g. `PackageClientWithoutLedgerId`.

- [Ledger Client Scala Bindings] The function `transactionSource` of the
    class `LedgerClientBinding` now optionally accepts a token which is
    passed on to the unterlying call.

- [JSON API] The Oracle database schema has changed; if using
  ``--query-store-jdbc-config``, you must rebuild the database by adding
  ``,start-mode=create-only``.  See #10539.
- [Trigger Service] ``--help`` no longer advertises unsupported JDBC
  options from JSON API.
- [JSON API] [EE only] By default, on Oracle, sets up a JSON search
  index to speed up the queries endpoints.  However, Oracle versions
  prior to 19.12 have an unrecoverably buggy implementation of this
  index; in addition, the current implementation fails on queries with
  strings >256 bytes, with no way to disable the index for that query.
  Pass the ``disableContractPayloadIndexing=true`` option as part of
  ``--query-store-jdbc-config`` to disable this index when creating the
  schema.
  See `issue #10539 <https://github.com/digital-asset/daml/pull/10539>`__.

- [Integration Kit] Changes the Offset format to contain a version and therefore reduces the highest index size by one byte

- [Ledger API Server] The command deduplication time is no longer used
  for determining the period of time to track the command before giving
  up. Instead, the gRPC deadline is used. If no deadline is provided
  (or if the deadline exceeds the command tracker retention period), the
   tracker retention period is used instead.

- [Daml Repl] Fix a bug where bindings with out of scope types would result in error in following lines.

- [Sandbox, participant] Added a flag to enable a new append-only database schema.
  This schema was designed to support significantly higher performance.
  In a future release, all applications will automatically migrate to the new schema.
- [Ledger API Server] The command service now uses the tracker retention
  period (typically specified with the ``--tracker-retention-period``
  command-line argument) as the maximum time to wait for a command to
  arrive on the completion stream. After this time, the command will
  time out, though it may still complete in the future. Previously, the
  deduplication period was used, but it was likely the tracker would be
  terminated before that anyway.

  The default tracker retention period is 5 minutes, unless otherwise
  specified.
- [DPP-418] [Participant] Add support for supplying server's private key as an encrypted file and then decrypting it with the help of a secrets server.
[Integration Kit] KV-based ledgers pass contract instances through the Write/ReadService, removing the need
for backfilling divulged contract lookups.
Note: KV Ledgers that have been created before this change will still be relying on backfilling lookups of divulged contracts,
hence pruning of all divulged contracts may result in failing lookups for divulged contracts.
ledger-api - add `deduplication_duration` as a future replacement for `deduplication_time` in the command proto definition
ledger-api - Command deduplication period can now be specified by setting `deduplication_offset` instead of `deduplication_time` (only valid for v2 WriteService). This change is backwards compatible.
```

CHANGELOG_BEGIN
CHANGELOG_END
aherrmann-da pushed a commit that referenced this pull request Sep 1, 2021
This PR has been created by a script, which is not very smart
and does not have all the context. Please do double-check that
the version prefix is correct before merging.

@aherrmann-da is in charge of this release.

Commit log:
```
f058c2f DPP-368 clean up flags (#10711)
90ad24f  [Divulgence pruning] Prune immediate divulgence [DPP-513] (#10691)
183934b Command dedup: add columns to completions in append-only schemas [KVL-1057] (#10652)
27c1333 LF: Drop outdated TODOs (#10725)
9be577a Enable pruning in the sandbox-classic when the append-only schema is used [DPP-567] (#10708)
9f072ae Ledger-API Conformance test for Contract ID V0 (#10717)
867547c DPP-368 enable append-only flag in sandbox (#10710)
bdc511e [DPP-438] Change open-ended metric names into static ones (by removing partyName part) (#10706)
0c82006 [Divulgence pruning] Prune all divulged contracts only after migration offset [DPP-483] (#10661)
2555dbb Use soft references for values in the caches (#10715)
7fd5906 Add LedgerClientWithoutLedgerId next to the LedgerClient (#10681)
856c69c participant-integration-api: Increase a test timeout. (#10721)
b86d07d remove non functioning oracle json array indices (#10720)
f5e1756 sandbox-classic: Remove default parameters in `SqlLedger.Owner`. (#10718)
9ef3377 LF: Update specification with Contract ID Comparability check (#10703)
e5c4734 enable JSON search index on Oracle by default (#10539)
1ded42f [DPP-418] Protect TLS keys - follow-up cleanup (#10696)
3fcd986 Introduce a new `Offset` format [KVL-1063] (#10668)
a5781a6 update NOTICES file (#10714)
8985505 participant-integration-api: Use deadlines, not deduplication times, for expiring trackers. [KVL-1009] (#10704)
65025c2 sandbox-classic: Add ErrorInfo metadata for rejections. [KVL-1048] (#10707)
97bda3c LF: V1 Contract ID check in Preprocessor (#10687)
c2f90ef Add CLI option to force disabling of participant deduplication (#10698)
6016633 Construct ParsedModule directly in Daml Repl (#10701)
bbdf16a DPP-368 unhide append-only CLI flags (#10697)
a41b134 Use the tracker retention period as the maximum expiry time. [KVL-1009] (#10700)
e750ba5 Make warning less scary. (#10699)
5f120bd rotate release duty after 1.17.0-snapshot.20210824.7647.0.640fb683 (#10660)
a17253f DPP-535 Verify postgres version (#10577)
301ce53 participant-integration-api: Add tests for ApiCommandService. [KVL-1009] (#10689)
bd01a21 [DPP-418] Protect Participant TLS keys (#10629)
7ee1324 update NOTICES file (#10695)
7c392f3 update NOTICES file (#10693)
3db654e update NOTICES file (#10690)
eff09c0 ledger-api-client: Wrap command submissions in a new class. [KVL-1009] (#10683)
d54adb2  Ledger-API: Conformance tests for contract IDs suffixing (#10654)
aa2e869 [Divulgence pruning] Pass divulged contract arguments through kvutils Write/ReadService [DPP-535] (#10598)
1a78313 Disable DeeplyNestedValueIT suite against canton in Daml repo (#10686)
b5f9be3 participant-integration-api: Standardize tracker retention naming. (#10682)
2aa632e ledger-on-sql: Do not increment the dispatcher head on start. (#10684)
eabb19d [ledger-api] Add deduplication_duration to deduplication period [kvl-1047] (#10676)
96ad9b5 [Divulgence pruning] All divulgence events pruning [DPP-483] (#10634)
0b7980d Update rules_haskell (#10674)
284edfc Fix FlywayMigrations datasource (#10666)
adbe65f Document ActionFail vs CanAbort (#10657)
52e7a6d update compat versions for 1.17.0-snapshot.20210824.7647.0.640fb683 (#10667)
f42e6b6 Expose pending contracts in triggers (#10672)
7cc6989 Add multiple ways of specifying deduplication [KVL-1047] (#10601)
53be19f participant-integration-api: Ensure that all waiting, failed, and closed trackers are cleaned up. (#10662)
b27cde6 participant-integration-api: Move tracker code around, and tidy up tests. (#10663)
fc9d359 Drop alternative rules from dlint config (#10646)
5204d3c Include committers in PartialTransaction root context (#10665)
387c68b Normalize transaction values within the engine (#10648)
430c1cc release 1.17.0-snapshot.20210824.7647.0.640fb683 (#10659)
ef239fd participant-integration-api: Move `TrackerMap` code around. [KVL-1009] (#10653)
```
Changelog:
```
- [Sandbox] - Participant pruning is enabled in the sandbox-classic when the append-only schema is used

- [Ledger Client Scala Bindings] A new variant of the LedgerClient class
    was added called `LedgerClientWithoutLedgerId`. This class does not
    need a ledger id at initialization. It was added to allow skipping
    any checks at initialization for use cases where either the
    ledger id is not known at initalization or no valid token can be fed
    at initialization for checking the ledger id. Furthermore for each
    classes `ActiveContractSetClient`, `CommandClient`, `PackageClient`,
    `TransactionClient`, `VersionClient` now exists a variant which
    doesn't depend on a ledger id at initialization and instead requires
    one for every function as parameter. Moreover the existing classes
    are extending these classes with overriding the methods and setting
    the default of the parameter with the given ledger id from
    initialization. The class `LedgerClientWithoutLedgerId` already
    makes usage of these variants e.g. `PackageClientWithoutLedgerId`.

- [Ledger Client Scala Bindings] The function `transactionSource` of the
    class `LedgerClientBinding` now optionally accepts a token which is
    passed on to the unterlying call.

- [JSON API] The Oracle database schema has changed; if using
  ``--query-store-jdbc-config``, you must rebuild the database by adding
  ``,start-mode=create-only``.  See #10539.
- [Trigger Service] ``--help`` no longer advertises unsupported JDBC
  options from JSON API.
- [JSON API] [EE only] By default, on Oracle, sets up a JSON search
  index to speed up the queries endpoints.  However, Oracle versions
  prior to 19.12 have an unrecoverably buggy implementation of this
  index; in addition, the current implementation fails on queries with
  strings >256 bytes, with no way to disable the index for that query.
  Pass the ``disableContractPayloadIndexing=true`` option as part of
  ``--query-store-jdbc-config`` to disable this index when creating the
  schema.
  See `issue #10539 <https://github.com/digital-asset/daml/pull/10539>`__.

- [Integration Kit] Changes the Offset format to contain a version and therefore reduces the highest index size by one byte

- [Ledger API Server] The command deduplication time is no longer used
  for determining the period of time to track the command before giving
  up. Instead, the gRPC deadline is used. If no deadline is provided
  (or if the deadline exceeds the command tracker retention period), the
   tracker retention period is used instead.

- [Daml Repl] Fix a bug where bindings with out of scope types would result in error in following lines.

- [Sandbox, participant] Added a flag to enable a new append-only database schema.
  This schema was designed to support significantly higher performance.
  In a future release, all applications will automatically migrate to the new schema.
- [Ledger API Server] The command service now uses the tracker retention
  period (typically specified with the ``--tracker-retention-period``
  command-line argument) as the maximum time to wait for a command to
  arrive on the completion stream. After this time, the command will
  time out, though it may still complete in the future. Previously, the
  deduplication period was used, but it was likely the tracker would be
  terminated before that anyway.

  The default tracker retention period is 5 minutes, unless otherwise
  specified.
- [DPP-418] [Participant] Add support for supplying server's private key as an encrypted file and then decrypting it with the help of a secrets server.
[Integration Kit] KV-based ledgers pass contract instances through the Write/ReadService, removing the need
for backfilling divulged contract lookups.
Note: KV Ledgers that have been created before this change will still be relying on backfilling lookups of divulged contracts,
hence pruning of all divulged contracts may result in failing lookups for divulged contracts.
ledger-api - add `deduplication_duration` as a future replacement for `deduplication_time` in the command proto definition
ledger-api - Command deduplication period can now be specified by setting `deduplication_offset` instead of `deduplication_time` (only valid for v2 WriteService). This change is backwards compatible.
```

CHANGELOG_BEGIN
CHANGELOG_END

Co-authored-by: Azure Pipelines DAML Build <support@digitalasset.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
component/daml-engine DAML-LF Engine & Interpreter
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants