Improvements to UTxO-HD: mempool snapshotting #1382

jasagredo · 2025-02-06T15:39:22Z

This PR accumulates some improvements for UTxO-HD that result in benefits for mempool snapshotting, which is part of the critical path for block minting (and therefore block diffusion). Running in a full-block synthetic chain shows these times for 50 consecutive mempool snapshots:

pre: at utxo-hd-main before this PR
caf: pre + moving the Cardano tx out translations to a CAF
no-diffs: caf + do not compute diffs when snapshotting the mempool
no-shtxin: no-diffs + remove the ShelleyTxIn newtype

nfrisby · 2025-02-07T17:04:07Z

I edited the PR description a little.

nfrisby

Mostly comments, fewer bangs, etc.

nfrisby · 2025-02-07T17:02:45Z

ouroboros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/HardFork/Combinator/Ledger.hs

@@ -1114,6 +1116,14 @@ class ( Show (HardForkTxOut xs)
  default txOutEjections :: CanHardFork xs => NP (K (NS WrapTxOut xs) -.-> WrapTxOut) xs
  txOutEjections = composeTxOutTranslations $ ipTranslateTxOut hardForkEraTranslation

+  txOutTails :: Tails (InPairs.Fn2 WrapTxOut) xs


Need a comment on this method, like for txOutEjections just above it.

It's a little awkward that we never intend for any instance to override this default 🤔.

I suppose the only alternative would be to define the polymorphic "CAF" and give it a SPECIALISE pragma. But... then we'd have to make sure there's a SPECIALISE chain that reaches from the Consensus entry point all the way to every use, which is a real headache. So, the awkwardness seems worthwhile 👍

With the current approach, the dictionary apparently handles that specialization chain for us. But I'm not exactly sure why it's able to do so, which makes me a little nervous. We have instance CardanoHardForkConstraints c => HasHardForkTxOut (CardanoEras c), which means that instance function (ie the thing that builds the HasHardForkTxOut dictionary from the other dictionaries) might get used in some code location that needs HasHardForkTxOut and already has a CardanoHardForkConstraints context. Since the txOutTails default does depend on that instance context (since it has CanHardFork in its own context, and our Cardano instance of that also requires CardanoHardForkConstraints), it wouldn't be shared among different uses of this instance function.

But, your measurements show that it does help. I just worry that we might accidentally spoil that someday, since it's not entirely transparent to us why tucking this polymorphic "CAF" inside a type class does actually end up using a proper CAF.

(This same concern applies to txOutEjections --- I just hadn't thought through the details when originally suggesting it.)

Edit: I guess as of the Ledger Team's monomorphic crypto work, the explanation will become much simpler, since CardanoHardForkConstraints only has one index, c! At that point, it'll be obvious that a sufficient requirement is for CanHardFork XS to be in-scope for the HasHardForkTxOut XS instance.

nfrisby · 2025-02-07T17:30:03Z

ouroboros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/Ledger/SupportsMempool.hs

+--
+-- When getting a mempool snapshot, we will revalidate all the
+-- transactions but we won't do anything useful with the resulting
+-- state. We can safely omit computing the differences in this case.


Please amend this comment to also discuss:

This is a worthwhile optimization (and has been measured as such), since snapshotting the mempool is on the critical path of block minting.

Eventually, the UTxO HD plan has always been for the ledger rules to construct the differences, instead of the Consensus layer computing them retroactively via calculateDifferences. Once that's true, we hope this optimization will no longer be worthwhile. That's part of the reason that we're content to just use a boolean isomorph instead and and yield empty Diffs instead of making it a GADT that makes the precise codomain (or just having two different functions, one with Diffs in the codomain and one without).

nfrisby · 2025-02-07T17:32:40Z

ouroboros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/Ledger/SupportsMempool.hs

+-- transactions but we won't do anything useful with the resulting
+-- state. We can safely omit computing the differences in this case.
+data ComputeDiffs
+  = ComputeDiffs


Comment saying that this option should be used with resyncing the mempool with an updated selection.

nfrisby · 2025-02-07T17:33:04Z

ouroboros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/Ledger/SupportsMempool.hs

+-- state. We can safely omit computing the differences in this case.
+data ComputeDiffs
+  = ComputeDiffs
+  | IgnoreDiffs


Comment saying that this option should be used when snapshotting the mempool when minting a block.

nfrisby · 2025-02-07T17:34:26Z

ouroboros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/Ledger/Tables/Utils.hs

+
+---
+
+unionValues ::


My intuition is that we should have a comment here explaining that collisions are impossible for this first phase of UTxO HD.

nfrisby · 2025-02-07T17:34:42Z

ouroboros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/Mempool/Impl/Common.hs

@@ -82,6 +83,9 @@ data InternalState blk = IS {
      -- This should always be in-sync with the transactions in 'isTxs'.
    , isTxIds        :: !(Set (GenTxId blk))

+    , isTxKeys       :: !(LedgerTables (LedgerState blk) KeysMK)


These two fields definitely need comments, which can probably state a pretty simple INVARIANT with respect to isTxs. I'd imagine it can be exact for isTxKeys and just a little hand-wavy for isTxValues.

nfrisby · 2025-02-07T17:38:30Z

ouroboros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/Mempool/Impl/Common.hs

+  in snapshotFromIS $ IS {
+         isTxs          = TxSeq.fromList $ map unwrap val
+       , isTxIds        = Set.fromList $ map (txId . txForgetValidated . fst) val
+       , isTxKeys       = emptyLedgerTables


I think a comment could help explain why isTxKeys and isTxValues can soundly be empty here.

nfrisby · 2025-02-07T17:40:54Z

ouroboros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/Mempool/Query.hs

-                values
-                (isLastTicketNo is)
-                (TxSeq.toList $ isTxs is)
+    else if pointHash (isTip is) == castHash (getTipHash ticked)


I think there should be one call to computeSnapshot, where the values has been conditionally recovered either from isTxValues or from isTxKeys.

nfrisby · 2025-02-07T17:54:35Z

ouroboros-consensus/src/ouroboros-consensus/Ouroboros/Consensus/Ledger/Tables.hs

+     -> Map.Map (TxIn l) (TxOut l)
+     -> CBOR.Decoder s (Map.Map (TxIn l) (TxOut l))
+  go 0 m = pure m
+  go !len m = do


I think the only place you need bangs are len and m, yeah? Since you're using the Data.Map.Strict interface. (Which would plug the leak in m here, also.)

amesgen and others added 4 commits February 7, 2025 10:55

db-analyser: support V2 LedgerDB

de7bd0c

Promote cardano translations to a CAF

ac00dba

Do not compute diffs in mempool snapshot

2707cee

Remove ShelleyTxIn

dc531bd

jasagredo force-pushed the js/tails-caf branch from b86b771 to dc531bd Compare February 7, 2025 09:56

jasagredo added 2 commits February 7, 2025 12:23

Fixup tests, format code

a4a3fa7

Don't accumulate thunks in deserialization of snapshots

10bc7d1

jasagredo changed the title ~~WIP: improvements to UTxO-HD~~ Improvements to UTxO-HD: mempool snapshotting Feb 7, 2025

jasagredo self-assigned this Feb 7, 2025

jasagredo added component-mempool UTxO-HD labels Feb 7, 2025

jasagredo marked this pull request as ready for review February 7, 2025 12:11

jasagredo requested review from nfrisby, amesgen, fraser-iohk, dnadales and geo2a as code owners February 7, 2025 12:11

nfrisby mentioned this pull request Feb 7, 2025

Move a couple of mempool operations to work on ledgerstates instead #1384

Open

nfrisby requested changes Feb 7, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improvements to UTxO-HD: mempool snapshotting #1382

Improvements to UTxO-HD: mempool snapshotting #1382

jasagredo commented Feb 6, 2025 •

edited by nfrisby

Loading

nfrisby commented Feb 7, 2025

nfrisby left a comment

nfrisby Feb 7, 2025

nfrisby Feb 7, 2025

nfrisby Feb 7, 2025

nfrisby Feb 7, 2025

nfrisby Feb 7, 2025

nfrisby Feb 7, 2025

nfrisby Feb 7, 2025

nfrisby Feb 7, 2025

nfrisby Feb 7, 2025

nfrisby Feb 7, 2025


		---

		unionValues ::

Improvements to UTxO-HD: mempool snapshotting #1382

Are you sure you want to change the base?

Improvements to UTxO-HD: mempool snapshotting #1382

Conversation

jasagredo commented Feb 6, 2025 • edited by nfrisby Loading

nfrisby commented Feb 7, 2025

nfrisby left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jasagredo commented Feb 6, 2025 •

edited by nfrisby

Loading