Upstream v2.60.0 #168

ImTei · 2024-05-09T05:41:07Z

interfaces PR: testinprod-io/erigon-interfaces#19

- Update erigon-lib/gointerface - Utilize `OverwriteSubscriptionExpiry` to reset expiry on subscription for SentinelServer/SentinelClient

# Downloader lock desing The snapshot lock design has been changed to be more flexible, before it was an empty file which determined whether to skip snapshots or not. now this file got extended a little bit. We now treat the lock file as a JSON file which has the following format: ```json ["headers", "bodies", etc...] ``` the strings are the stringfied snapshot types: ```go var Enums = struct { Unknown, Headers, Bodies, Transactions, BorEvents, BorSpans, BeaconBlocks, BlobSidecars , } ``` After each download is finished we push into the list the enums of the downloaded snapshots (for a normal sync that is only `Headers`, `Bodies` and `Transactions`+ Bor). When the node starts we prohibit all the snapshot types in the lock file and keep download open for the ones not in it. --------- Co-authored-by: alex.sharov <[email protected]>

…#9779) This is kind of a low priority issue. We need blobs corresponding to each blocks, but before we were not checking if we received all of them, now we ask until the request returns the desired amount.

new_heads events were not correctly emitted during block forks. This fix ensures accurate event generation and emission in fork scenarios.

Shortening body encode/decode RLP code by extracting repeated parts (or similar parts) of the code into own functions (aka reusability) and increasing readability (not super necessary, just avoiding code repetition)

…alse (erigontech#9794)

Cherry picked initial commit against devel. Beginning of discussion is here erigontech#9750

there are bor-mainnet and sepolia files also up some deps like `x` and `grpc`

`miner.recommit` flag wasn't registered earlier to use in cli. This PR adds that. Note: This is for validator support on Polygon PoS.

Signed-off-by: depthlending <[email protected]>

This PR also does the following additional things: * Introduces correct JSON marshalling/unmarshalling to all objects * BaseFeePerGas marshalled as integer on json caplin-wise * Reduced dumpSlotsStates from 32 to 4 for better performance during reorgs * Added full lock for `OnAttestation` ## Block Production This section highlights how `GET eth/v3/validator/blocks/{slot}` creates a block and then publishes it. The validator client will do execute 2 steps when producing a beacon block. 1) Production step: tell the beacon client to create a block and return it. 2) Publishing step: Sign the block with the proposer private key, and send it back for publishing to other nodes. ### Block creation Let's first look at how block creation happens. So Caplin needs to do 2 things to successfully create a block: 1) Asking the Execution Layer for the Execution block 2) Retrieve Consensus Operations to include in the block (Attestations/VoluntaryExits, etc...) #### Execution block For the execution block, it is quite simple, we ask Erigon to produce us a block through the `AssembleBlock` function avaiable on the Erigon `Eth1` API. We treat erigon as a black box so we do not need to worry too much about this. However, we also need to handle **Blob** bundles, so that later, when we need to publish a block. we can publish the bundles alongside the block (it is important that peers both receive block and the blob or we will fail a check). (Erigon will also gives us the bundle). Right now, we store the blob bundle in an `LRU` which has size set to 8 blocks worth of blobs. **Note: we use an LRU for the convenient eviction policy**. #### Operations TODO. Operations inclusion has not been implemented yet, the execution block is the only thing being delivered. ### Block publishing After we produce the beacon block, we will send it back to the Validator Client, which will sign it and re-forward it to the rest of the network. The flow is straightforward, when we receive the block we simply: 1) pack the block with the blobs the Execution Layer gave caplin during block production 2) Start a separate thread where we import the block into Caplin's database and forkchoice alongside the blobs.. 3) Publish blobs and blocks to the P2P.

One line summary: We must use pointer receiver type for tx methods because if not, value receivers will copy the struct containing `atomic.Value` field, which is prohibited to be copied. `TransactionMisc` struct is embedded to to every tx type, so we must use pointer receiver for every tx methods. For more context, struct `TransactionMisc` is defined as below: ```go type TransactionMisc struct { // caches hash atomic.Value //nolint:structcheck from atomic.Value } ``` `TransactionMisc` is embedded to struct `AccessListTx`, `BlobTx`, `BlobTxWrapper`, `DynamicFeeTransaction`, `CommonTx`, `LegacyTx`. Methods for these structs tend to use [value receiver, not a pointer receiver](https://go.dev/tour/methods/8). When value receiver method is used, the program copies the struct value and uses it. `TransactionMisc` is embedded, so its fields `hash` and `from` are also copied. However these fields' types are `atomic.Value`, which are [not allowed to be copied](https://go.dev/src/sync/atomic/value.go) after first use. This guideline is also mentioned at [Google's golang style guide](https://google.github.io/styleguide/go/decisions#receiver-type). Therefore we must use pointer receiver to avoid synchronization issues. Also, using pointer receivers may be more efficient if receiver is large struct. Co-authored-by: Andrew Ashikhmin <[email protected]>

This PR contains changes which related to gather information about "Bodies" stage. Change list is next: - added entities for block download, write, process and processing - added listeners and collect info for above - added API to query this data

… too late checking for whitelist. need check before adding to lib (erigontech#9804)

This PR does the following: * Implement correct handling of beacon proof handler (without aggregation) * Disable beacon aggregate and sync committee contribution gossip if not in validator mode Check implemented: https://github.com/ethereum/consensus-specs/blob/dev/specs/phase0/p2p-interface.md#beacon_aggregate_and_proof

SonarCloud highlights duplicated code branches as bugs

Addresses: erigontech#9754 (comment)

Addresses: erigontech#9672 (comment)

setup_debug is non-env global func

The responsibility to maintain the status data is moved from the stageloop Hook and MultiClient to the new StatusDataProvider. It reads the latest data from a RoDB when asked. That happens at the end of each stage loop iteration, and sometimes when any sentry stream loop reconnects a sentry client. sync.Service and MultiClient require an instance of the StatusDataProvider now. The MessageListener is updated to depend on an external statusDataFactory.

according to - erigontech#9787 (comment)

…ch#10164) fixes a 2nd regression introduced by - erigontech#7593 - it generates duplicate struct types in the same package (check screenshot below) - also found a better way to fix the first regression with unused imports (improvement over erigontech#10091) <img width="1438" alt="Screenshot 2024-04-30 at 17 30 42" src="https://github.com/ledgerwatch/erigon/assets/94537774/154d484b-4b67-4104-8a6e-eac2423e1c0e">

Cherry pick PR erigontech#10155 into the release Co-authored-by: Dmytro <[email protected]>

…nt/GetHeader (erigontech#9786) (erigontech#9894) * improved logging * check ctx in ServeHTTP: The context might be cancelled if the client's connection was closed while waiting for ServeHTTP. * If execution API returns ExecutionStatus_Busy, limit retry attempts to 10 seconds. This timeout must be lower than a typical client timeout (30 sec), in order to give the client feedback about the server status. * If execution API returns ExecutionStatus_Busy, increase retry delay from 10 ms to 100 ms to avoid stalling ourselves with multiple busy loops. IMO this delay should be higher (e.g. 1 sec). Ideally we shouldn't do polling at all, but doing a blocking ctx call requires rearchitecting the ExecutionStatus_Busy logic. see erigontech#9786

…0183)

Cherry pick PR erigontech#10187 into the release Co-authored-by: Giulio rebuffo <[email protected]>

release cherry pick

This PR brings the changes of erigontech#10195 to the branch release/2.60 with the necessary modifications

Running a test every day doesn't make sense on an inactive branch. It also seems that the schedule trigger favours the main branch if the test workflow has the same name on the main and other branches. So this PR changes the test trigger to "push events".

for erigontech#10203

Cherry pick PR erigontech#10214 into the release Co-authored-by: Alex Sharov <[email protected]>

…#10224) This adds torrent fixes that remove bad peers due to non handling of http errs.

fixed start diag server if metrics address is different from pprof address --------- Co-authored-by: taratorio <[email protected]>

Cherry pick PR erigontech#10215 into the release Co-authored-by: Alex Sharov <[email protected]>

…10243) Pick erigontech/erigon-snapshot#160

…ntech#10294)

fix for ``` [p2p] Server protocol=68 peers=2 trusted=0 inbound=1 LOG15_ERROR= LOG15_ERROR= LOG15_ERROR= LOG15_ERROR= LOG15_ERROR= i/o timeout=53 EOF=65 closed by remote=215 too many peers=6 ecies: invalid message=5 ```

domiwei and others added 30 commits March 21, 2024 11:47

Sentinel client set expiry (erigontech#9776)

28c0c57

- Update erigon-lib/gointerface - Utilize `OverwriteSubscriptionExpiry` to reset expiry on subscription for SentinelServer/SentinelClient

Caplin: Fixed not calling FCU due to faulty blob handling (erigontech…

b682aa1

…#9779) This is kind of a low priority issue. We need blobs corresponding to each blocks, but before we were not checking if we received all of them, now we ask until the request returns the desired amount.

Fix new_heads Events Emission on Block Forks (erigontech#9738)

f4aefdc

new_heads events were not correctly emitted during block forks. This fix ensures accurate event generation and emission in fork scenarios.

Shortening body encode/decode RLP code (erigontech#9691)

0904daa

Shortening body encode/decode RLP code by extracting repeated parts (or similar parts) of the code into own functions (aka reusability) and increasing readability (not super necessary, just avoiding code repetition)

Fixed BenchmarkMining integration (erigontech#9789)

c610e1b

fixed downloading of unnecessary blocks when --caplin.backfilling=f…

4369d4e

…alse (erigontech#9794)

optimized send snapshot download statistics (erigontech#9777)

909363b

added command to verify remote manifests from webseeds (erigontech#9762)

fa40ef6

Cherry picked initial commit against devel. Beginning of discussion is here erigontech#9750

polygon/sync: handle FetchHeaders errors in sync.go (erigontech#9739)

c0d7868

move backup pkg to erigon-lib (erigontech#9769)

b288f07

update snaps dep version (erigontech#9797)

b5d1e85

there are bor-mainnet and sepolia files also up some deps like `x` and `grpc`

turbo/cli: register miner.recommit flag in cli (erigontech#9716)

6d51739

`miner.recommit` flag wasn't registered earlier to use in cli. This PR adds that. Note: This is for validator support on Polygon PoS.

chore: remove repetitive words (erigontech#9785)

636d006

Signed-off-by: depthlending <[email protected]>

make webseed fetch error print to logger (erigontech#9799)

8e39498

manifest logging with debug (erigontech#9800)

8874590

dvovk/bodies collect (erigontech#9787)

010b230

This PR contains changes which related to gather information about "Bodies" stage. Change list is next: - added entities for block download, write, process and processing - added listeners and collect info for above - added API to query this data

downloader: when torrent added to lib and metadata resolved - already…

8fe8a23

… too late checking for whitelist. need check before adding to lib (erigontech#9804)

Remove duplicated branches (erigontech#9809)

fddaf29

SonarCloud highlights duplicated code branches as bugs

metrics: Fix conversion unit (erigontech#9813)

f8bcf32

Addresses: erigontech#9754 (comment)

metrics: Add logging to other binaries (erigontech#9816)

14ee1a4

Addresses: erigontech#9672 (comment)

polygon/sync: block downloader limit mem usage (erigontech#9718)

b4bb21b

mdbx: call global func only for chandb (erigontech#9807)

c9b1ed8

setup_debug is non-env global func

added resources usage stats (erigontech#9828)

3e53b02

qa-tests: fix path of result file (erigontech#9791)

b965d7e

removed returning an error for send func (erigontech#9812)

3d5fffc

according to - erigontech#9787 (comment)

taratorio and others added 26 commits May 2, 2024 09:36

dvovk/pprof fix (erigontech#10155) (erigontech#10178)

4079f4e

Cherry pick PR erigontech#10155 into the release Co-authored-by: Dmytro <[email protected]>

torrent v1.54.2-alpha -> v1.54.2-alpha-7 (release/2.60) (erigontech#1…

b59f04c

…0183)

Unnecessary Logs in sentry removed (erigontech#10190)

a5257bf

Cherry pick PR erigontech#10187 into the release Co-authored-by: Giulio rebuffo <[email protected]>

nil block during execution (erigontech#10193)

6648899

release cherry pick

qa-tests: updating test workflow on release/2.60 (erigontech#10196)

49e0171

This PR brings the changes of erigontech#10195 to the branch release/2.60 with the necessary modifications

Release: fix logs spam (erigontech#10211)

a1e1338

for erigontech#10203

Blocks snaps - see 0 indices after reopen (erigontech#10219)

948e781

Cherry pick PR erigontech#10214 into the release Co-authored-by: Alex Sharov <[email protected]>

torrent v1.54.2-alpha-7 -> v1.54.2-alpha-8 (release/2.60) (erigontech…

40d1327

…#10224) This adds torrent fixes that remove bad peers due to non handling of http errs.

fixed start diag server (erigontech#10236)

764706d

fixed start diag server if metrics address is different from pprof address --------- Co-authored-by: taratorio <[email protected]>

params: version 2.60.0-rc1 (erigontech#10230)

7d41c27

downloader: --seedbox doesn't init snaptypes (erigontech#10245)

32f7775

Cherry pick PR erigontech#10215 into the release Co-authored-by: Alex Sharov <[email protected]>

Merge tag 'v2.60.0-rc1' into upstream-v2.60.0-rc1

4ae11a2

Update erigon-interfaces version

3a514ba

Fix build errors

522ae42

Fix test build errors

508d5f5

Fix TestGetReceipts to use mock DB

23c77b6

e2: bor-mainnet fix broken v1-054600-054700-borspans.seg (erigontech#…

23908e4

…10243) Pick erigontech/erigon-snapshot#160

e2: set dirty-space for chaindb to 512mb (erigontech#10269)

620d425

Fix potential index out of bounds in decodeBlobVersionedHashes (erigo…

64f677a

…ntech#10294)

remove nils from p2p logs (erigontech#10303)

e67eaaf

fix for ``` [p2p] Server protocol=68 peers=2 trusted=0 inbound=1 LOG15_ERROR= LOG15_ERROR= LOG15_ERROR= LOG15_ERROR= LOG15_ERROR= i/o timeout=53 EOF=65 closed by remote=215 too many peers=6 ecies: invalid message=5 ```

params: version 2.60.0 (erigontech#10330)

7883a4e

Merge tag 'v2.60.0' into upstream-v2.60.0-rc1

bce7648

erigon-lib go mod tidy

f663d04

ImTei changed the title ~~Upstream v2.60.0-rc1~~ Upstream v2.60.0 May 16, 2024

ImTei requested review from mininny and pcw109550 May 16, 2024 00:27

ImTei merged commit 4b1a315 into op-erigon May 17, 2024
5 of 6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upstream v2.60.0 #168

Upstream v2.60.0 #168

ImTei commented May 9, 2024 •

edited

Loading

Upstream v2.60.0 #168

Upstream v2.60.0 #168

Conversation

ImTei commented May 9, 2024 • edited Loading

ImTei commented May 9, 2024 •

edited

Loading