core/state: move state log mechanism to a separate layer #30569

holiman · 2024-10-10T12:44:23Z

In this PR, I have moved the logging-facilities out of *state.StateDB, in to a wrapping struct which implements vm.StateDB instead.

In most places, it was pretty straight-forward.

First, hoisting the invocations from state objects up to the statedb.
Then making the mutation-methods simply return the previous value, so that the external logging layer could log everything.

Some internal code uses the direct object-accessors to mutate the state, particularly in testing and in setting up state overrides, which means that these changes are unobservable for the hooked layer. This is fine, how we configure the overrides are not necessarily part of the API we want to publish.

The trickiest part about the layering is that when the selfdestructs are finally deleted during Finalise, there's the possibility that someone sent some ether to it, which is burnt at that point, and thus needs to be logged. The hooked layer reaches into the inner layer to figure out these events.

In package vm, the conversion from state.StateDB + hooks into a hooked vm.StateDB is performed where needed.

core/blockchain.go

holiman · 2024-10-11T06:42:43Z

After having considered it some more, I am even more convinced that the approach of #30441, adding read-hooks inside the state.StateDB, is bad for more reasons than I thought previously.

It does not discriminate between event sources.

In the state processing, between transactions, we check balance and nonce. This would trigger the read-hooks. However, I'd argue that this is bad: it leaks implementation details and is not useful information. For example: maybe we check all txs nonces in a block first, before we commence executing transactions. Or maybe we verify each transcation (balance + nonce etc) right before execution. And maybe we invoke GetNonce twice, in some place, rather than save it as a temp variable and pass to another callsite. These things are internals, and should not be observable by the tracers. It makes the tracers more complex, having to try to figure out which events are actually useful and which are not.
There's an element of recursiveness: if, in an OnOpCode or step function, the tracer decides to look up an account, it will then trigger a read-hook. Making the call-graph evm interpreter loop -> tracer engine->onOp -> statedb -> tracer engine->OnBalanceRead. Even worse: if the balance read-hook for address X queries the balance for address X+1, then we have a true recursive situation.

The "solution" to these sorts of problems would be to, in certain situations, disable the logger. So while executing the read-hook, the logger would be removed and then added back in defer. I think it becomes pretty messy.

With the layered solution, there are no such complexities, as long as we can switch between the logging-statedb and the non-logging-statedb.

We can simply use a non-logging-statedb while processing the tx envelopes in state processor.
We can simply pass a non-logging-statedb to the tracer engine.

Switching between one and the other can probably be done in many ways, I'm open to suggestions. One way would be to have two interfaces

type LoggingEnabled interface {
    WithLoggingDisabled() vm.StateDB
}
type LoggingDisabled interface {
    WithLoggingEnabled() vm.StateDB
}

Example how that would look, going from a dual-layered logging statedb to a single-layered raw statedb:

	if evm.Config.Tracer != nil && evm.Config.Tracer.OnTxStart != nil {
		ctx := evm.GetVMContext()
		newctx := &(*ctx) // shallow copy
		if sdb, ok := ctx.StateDB.(vm.LoggingEnabled); ok {
			newctx.StateDB = sdb.WithLoggingDisabled()
		}
		evm.Config.Tracer.OnTxStart(newctx, tx, msg.From)
		if evm.Config.Tracer.OnTxEnd != nil {
			defer func() {
				evm.Config.Tracer.OnTxEnd(receipt, err)
			}()
		}
	}

holiman · 2024-10-11T09:33:09Z

Minus the ugliness regarding swapping between shimmed and non-shimmed state, this PR is mostly done. Ideas for how to make the switching nicer are appreciated

core/state/statedb_logger.go

s1na · 2024-10-11T11:11:14Z

In the state processing, between transactions, we check balance and nonce. This would trigger the read-hooks. However, I'd argue that this is bad: it leaks implementation details and is not useful information. For example: maybe we check all txs nonces in a block first, before we commence executing transactions. Or maybe we verify each transcation (balance + nonce etc) right before execution. And maybe we invoke GetNonce twice, in some place, rather than save it as a temp variable and pass to another callsite. These things are internals, and should not be observable by the tracers. It makes the tracers more complex, having to try to figure out which events are actually useful and which are not.

My understanding for the main use-case of the read hooks is to collect the prestate for the transaction/call. So the ordering and how often we emit a OnReadBalance doesn't matter for the tracers. IF that were to matter you are right, then we would need to add a reason to specify what is this read about.

There's an element of recursiveness: if, in an OnOpCode or step function, the tracer decides to look up an account, it will then trigger a read-hook. Making the call-graph evm interpreter loop -> tracer engine->onOp -> statedb -> tracer engine->OnBalanceRead. Even worse: if the balance read-hook for address X queries the balance for address X+1, then we have a true recursive situation.

This was an interesting realization. I think the friction point here is statedb emitting logs for the same methods that are exposed to the tracers via a statedb instance. Honestly I don't like so much that we are exposing statedb to the tracers. We had to do it exactly to fetch prestate values. So IMO if we add read hooks we can drop the statedb. But I'd ask for opinion from users before committing to that.

Generally I am ok with your approach if it allows us to keep the read hooks :)

holiman · 2024-10-11T11:24:51Z

Honestly I don't like so much that we are exposing statedb to the tracers. We had to do it exactly to fetch prestate values. So IMO if we add read hooks we can drop the statedb.

Well, the prestate tracer is perhaps the first driving usecase, but querying state from a tracer is *very* useful and powerful. I would prefer it to remain, definitely! Trying to collect state by catching per-scope readhooks sounds like a nightmare in comparison :)

Generally I am ok with your approach if it allows us to keep the read hooks :)

With my approach as a basis, I won't object to you read-hooking the entire statedb interface.

fjl · 2024-10-11T18:54:49Z

I think if it's possible to remove state read access from tracers, we should do it. Full state access will become impossible later with stateless clients, so it will have to be removed at that time anyway.

namiloh · 2024-10-12T08:56:21Z

Well, I would assume users of somewhat advanced tracing to be using stateful clients.

Anyway, it is a decision unrelated to this PR. It is related to the other PR, since that one doesn't have the ability to switch between logging/nonlogging statedb.

If we want to remove state access, let's someone make a PR and discuss it then, IMO.

@fjl , you are a type artist. Any ideas for making the hacks in this PR neater?

holiman · 2024-10-12T14:24:35Z

@rjl493456442 implemented an alternative state overriding here: #29950. In this implementation, we switch out the backend from underneath the *state.StateDB, so that the Reader which it uses, checks the overrides before reaching into the snapdata or triedata.
Reader interface:

// Reader defines the interface for accessing accounts and storage slots
// associated with a specific state.
type Reader interface {
	// Account retrieves the account associated with a particular address.
	//
	// - Returns a nil account if it does not exist
	// - Returns an error only if an unexpected issue occurs
	// - The returned account is safe to modify after the call
	Account(addr common.Address) (*types.StateAccount, error)

	// Storage retrieves the storage slot associated with a particular account
	// address and slot key.
	//
	// - Returns an empty slot if it does not exist
	// - Returns an error only if an unexpected issue occurs
	// - The returned storage slot is safe to modify after the call
	Storage(addr common.Address, slot common.Hash) (common.Hash, error)

	// Copy returns a deep-copied state reader.
	Copy() Reader
}

But if the the core parts of vm now accesses the statedb through the interface vm.StateDB, instead of directly the struct, and we now have the ability to layer them, then we would have an alternative option to layer the overrides on top of the statedb instead.

So we could have e.g. logging layer <-> override layer <-> regular statedb <-> snapdata/trie .
Or we could have e.g. logging layer <-> regular statedb <-> override layer <-> snapdata/trie .

I am not really sure what are the pros and cons with either approach. @rjl493456442 any thoughts?

core/state/statedb_logger.go

s1na

LGTM

core/state/statedb_logger.go

core/vm/interface.go

fjl · 2024-10-17T12:41:59Z

Notes:

wrap the StateDB deeper into the processing call stack, that way we can avoid the extra methods like IntermediateRoot
pass logger + statedb in ApplyTransactionWithEVM and wrap there
remove SetLogger on StateDB
duplicate vm.StateDB methods in the wrapper to avoid embedding of StateDB. It's more boilerplate but safer for the future.
for the burn tracking of deleted accounts in Finalise, perhaps it would be acceptable to keep Finalise in the vm.StateDB interface. In the wrapper implementation, we could iterate the mutated state and emit a balance change for all deleted accounts that still have a balance.
alternative: pass the burn callback into Finalise as a parameter

core/state/statedb.go

holiman · 2024-10-18T12:29:31Z

Notes:

wrap the StateDB deeper into the processing call stack, that way we can avoid the extra methods like IntermediateRoot
pass logger + statedb in ApplyTransactionWithEVM and wrap there (logger passed implicitly via the evm config)
remove SetLogger on StateDB
duplicate vm.StateDB methods in the wrapper to avoid embedding of StateDB. It's more boilerplate but safer for the future.
for the burn tracking of deleted accounts in Finalise, perhaps it would be acceptable to keep Finalise in the vm.StateDB interface. In the wrapper implementation, we could ~~iterate the mutated state~~ and emit a balance change for all deleted accounts that still have a balance.
- Instead of iterating mutations, I iterate s.inner.journal.dirties. The reason is that the mutations are per-block, not per-tx. The journal dirties are in the correct scope
- ~~alternative: pass the burn callback into Finalise as a parameter~~

…te layer core/state: wip move state log mechanism in to a separate layer core/state: fix tests core/state: fix miscalc in OnBalanceChange eth/tracers: fix tests core/state: re-enable verkle witness generation internal/ethapi: fix simulation + new logging schema

…g burn core/state, core/vm: refactor statedb hooking core/state: trace consensus finalize and system calls eth/tracers/internal/tracetest: fix tests after refactor core/state: some renaming and cleanup of statedb-hooking system core/state: remove unecessary methods, implement hooked subbalance, more testing

rjl493456442 · 2024-10-21T06:51:08Z

core/state_processor.go

@@ -98,7 +99,10 @@ func (p *StateProcessor) Process(block *types.Block, statedb *state.StateDB, cfg
 		receipts = append(receipts, receipt)
 		allLogs = append(allLogs, receipt.Logs...)
 	}
-
+	var tracingStateDB = vm.StateDB(statedb)


Why don't we create a hookedStatedb in the first place, ahead of transaction execution?

Now we create a hooked one for each transaction, it looks wasteful for me, perhaps I miss something?

rjl493456442 · 2024-10-21T06:51:36Z

core/state_processor.go

-			defer func() {
-				evm.Config.Tracer.OnTxEnd(receipt, err)
-			}()
+	var tracingStateDB = vm.StateDB(statedb)


Why don't we allocate a hooked statedb and reuse it for all transactions?

Initially, I created the hooked statedb higher up in the call hierarchy, but the is that at this location, we still need the *state.StateDB, to perform the IntermediateRoot.

The higher we place the instantiation of hooked, we're forced to either cram more methods into vm.StateDB interface (which I did initially), or we need to pass both the *state.StateDB struct and the vm.StateDB interface side by side.

I'll take another look at what to do. Reusing is nice.

So, this method needs the *state.StateDB here, in order to create the receipt before return. We could create it in the caller, and pass both of them... but there are three separate callsites which call in here, and IMO doing this in those three locations is messier than constructing it here.

It's a pretty small object, shouldn't make that much difference IMO.

internal/ethapi/api.go

core/state/statedb_hooked.go

core/state/statedb.go

rjl493456442 · 2024-10-21T07:27:14Z

core/state/statedb_hooked.go

+	return prev
+}
+
+func (s *hookedStateDB) SetNonce(address common.Address, nonce uint64) {


Should we rename this SetNonce to IncreaseNonce? The semantic always expect to increment the nonce by 1

Yes, good idea!

core/state/statedb_hooked.go

holiman · 2024-10-21T08:09:05Z

@rjl493456442 your commit 8c7526c undoes an intentional change:

duplicate vm.StateDB methods in the wrapper to avoid embedding of StateDB. It's more boilerplate but safer for the future.

rjl493456442 · 2024-10-21T10:58:52Z

@holiman I undo my cleanup commit and push various fixes on top

lightclient

This seems like a really nice refactor. Left a couple thoughts. Looking forward to trying another statedb wrapper for witness gathering.

lightclient · 2024-10-21T17:45:45Z

core/state/statedb_hooked.go

+
+// hookedStateDB represents a statedb which emits calls to tracing-hooks
+// on state operations.
+type hookedStateDB struct {


tracingStateDB?

hookedStateDB means very little to me. I would have kinda expected it to have generic hooks support instead of just tracing hooks.

I initially called it that. But then I thought, it's using the *tracing.Hooks. I can go with either

core/state/statedb_hooked.go

namiloh · 2024-10-22T06:59:27Z

core/state/statedb_hooked.go

+	if !prev.IsZero() && changed {
+		if s.hooks.OnBalanceChange != nil {
+			s.hooks.OnBalanceChange(address, prev.ToBig(), new(big.Int), tracing.BalanceDecreaseSelfdestruct)


I suspect this is now wrong. This balancechange here only triggers for the decrease, so presumably theres a corresponding increase trigger somewhere. It may be that selfdestruct(self) now triggers the + but not the -. I'll check it once I have a functional IDE again.. Would be good to have this covered by a testcase

So, this happens prior

func opSelfdestruct6780(pc *uint64, interpreter *EVMInterpreter, scope *ScopeContext) ([]byte, error) { if interpreter.readOnly { return nil, ErrWriteProtection } beneficiary := scope.Stack.pop() balance := interpreter.evm.StateDB.GetBalance(scope.Contract.Address()) interpreter.evm.StateDB.SubBalance(scope.Contract.Address(), balance, tracing.BalanceDecreaseSelfdestruct) interpreter.evm.StateDB.AddBalance(beneficiary.Bytes20(), balance, tracing.BalanceIncreaseSelfdestruct) interpreter.evm.StateDB.Selfdestruct6780(scope.Contract.Address())

Regardless of who the recipient is (self or other), and whether the contract is new or old, both decrese and increase fires here.

The cases that exist are

new contract, beneficiary other:

decrease-event in opcode

increase-event in opcode

No events triggered in statedb.

new contract, beneficiary self:

decrease-event in opcode

increase-event in opcode

triggers decrease-event in statedb + sets to zero

old contract, beneficiary other:

decrease-event in opcode

increase-event in opcode

No events triggered in statedb.

old contract, beneficiary self:

decrease-event in opcode

increase-event in opcode

No events triggered in statedb.

I think all of these cases are handled correctly.

Actually, except for one thing. The 2.iii, when a new contract is self-destructed with self-recipient, then it should be traced as a burn, not a balancedecreasselfdestruct (beneficiary).

There are basically two burns: one immediate when a contract does selfdestruct-to-self, and one later on, which only ever happens if someone sends some funds to an already destructed account.

So as I see it, line 204 and 214 should use BalanceDecreaseSelfdestructBurn.

However, it seems @s1na has already considered this and decided to not handle it that way:

// BalanceDecreaseSelfdestructBurn is ether that is sent to an already self-destructed // account within the same tx (captured at end of tx). // Note it doesn't account for a self-destruct which appoints itself as recipient. BalanceDecreaseSelfdestructBurn BalanceChangeReason = 14

So, in other words, all good!

holiman mentioned this pull request Oct 10, 2024

core/tracing: v1.1 #30441

Open

holiman commented Oct 10, 2024

View reviewed changes

core/blockchain.go Outdated Show resolved Hide resolved

holiman force-pushed the shimmed_statedb branch from ec06411 to 4a25e24 Compare October 10, 2024 18:19

holiman force-pushed the shimmed_statedb branch from cae550b to 68a0aff Compare October 11, 2024 09:29

holiman marked this pull request as ready for review October 11, 2024 09:33

holiman requested review from lightclient, s1na, karalabe and rjl493456442 as code owners October 11, 2024 09:33

holiman changed the title ~~core/state: wip move state log mechanism in to a separate layer~~ core/state: move state log mechanism to a separate layer Oct 11, 2024

s1na reviewed Oct 11, 2024

View reviewed changes

core/state/statedb_logger.go Outdated Show resolved Hide resolved

holiman added the status:triage label Oct 15, 2024

rjl493456442 reviewed Oct 15, 2024

View reviewed changes

core/state/statedb_logger.go Outdated Show resolved Hide resolved

holiman force-pushed the shimmed_statedb branch from 0a4002e to 54d7790 Compare October 15, 2024 13:52

s1na reviewed Oct 16, 2024

View reviewed changes

core/state/statedb_logger.go Outdated Show resolved Hide resolved

s1na approved these changes Oct 17, 2024

View reviewed changes

holiman commented Oct 17, 2024

View reviewed changes

core/state/statedb_logger.go Outdated Show resolved Hide resolved

core/vm/interface.go Outdated Show resolved Hide resolved

lightclient reviewed Oct 17, 2024

View reviewed changes

core/state/statedb.go Outdated Show resolved Hide resolved

lightclient reviewed Oct 17, 2024

View reviewed changes

core/state/statedb.go Outdated Show resolved Hide resolved

lightclient reviewed Oct 17, 2024

View reviewed changes

core/state/statedb.go Outdated Show resolved Hide resolved

holiman added 2 commits October 18, 2024 16:58

core/state: add testcase for handling of burned balances

4de4fa4

holiman force-pushed the shimmed_statedb branch from 16fa089 to 8ca0cd5 Compare October 18, 2024 14:59

rjl493456442 self-assigned this Oct 20, 2024

rjl493456442 reviewed Oct 21, 2024

View reviewed changes

internal/ethapi/api.go Show resolved Hide resolved

rjl493456442 reviewed Oct 21, 2024

View reviewed changes

core/state/statedb_hooked.go Show resolved Hide resolved

rjl493456442 reviewed Oct 21, 2024

View reviewed changes

core/state/statedb.go Show resolved Hide resolved

rjl493456442 reviewed Oct 21, 2024

View reviewed changes

core/state/statedb_hooked.go Outdated Show resolved Hide resolved

core/state: fix several hook issues

21f5a77

rjl493456442 force-pushed the shimmed_statedb branch from 350d4bb to 21f5a77 Compare October 21, 2024 10:58

lightclient reviewed Oct 21, 2024

View reviewed changes

namiloh reviewed Oct 22, 2024

View reviewed changes

holiman added this to the 1.14.12 milestone Oct 22, 2024

core/state: more consistent naming

192e00b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

core/state: move state log mechanism to a separate layer #30569

core/state: move state log mechanism to a separate layer #30569

holiman commented Oct 10, 2024 •

edited

Loading

holiman commented Oct 11, 2024 •

edited

Loading

holiman commented Oct 11, 2024

s1na commented Oct 11, 2024

holiman commented Oct 11, 2024 via email

fjl commented Oct 11, 2024

namiloh commented Oct 12, 2024

holiman commented Oct 12, 2024

s1na left a comment

fjl commented Oct 17, 2024

holiman commented Oct 18, 2024

rjl493456442 Oct 21, 2024

rjl493456442 Oct 21, 2024

namiloh Oct 22, 2024

namiloh Oct 22, 2024

rjl493456442 Oct 21, 2024

namiloh Oct 22, 2024

holiman commented Oct 21, 2024

rjl493456442 commented Oct 21, 2024

lightclient left a comment

lightclient Oct 21, 2024

namiloh Oct 22, 2024

namiloh Oct 22, 2024

namiloh Oct 22, 2024

namiloh Oct 22, 2024

core/state: move state log mechanism to a separate layer #30569

Are you sure you want to change the base?

core/state: move state log mechanism to a separate layer #30569

Conversation

holiman commented Oct 10, 2024 • edited Loading

holiman commented Oct 11, 2024 • edited Loading

holiman commented Oct 11, 2024

s1na commented Oct 11, 2024

holiman commented Oct 11, 2024 via email

fjl commented Oct 11, 2024

namiloh commented Oct 12, 2024

holiman commented Oct 12, 2024

s1na left a comment

Choose a reason for hiding this comment

fjl commented Oct 17, 2024

holiman commented Oct 18, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

holiman commented Oct 21, 2024

rjl493456442 commented Oct 21, 2024

lightclient left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

holiman commented Oct 10, 2024 •

edited

Loading

holiman commented Oct 11, 2024 •

edited

Loading