Resharding V3 - add a few state sync details #573

marcelo-gonzalez · 2024-11-13T02:29:56Z

No description provided.

wacban · 2024-11-13T15:55:13Z

neps/nep-0568.md

+When nodes sync state (either because they've fallen far behind the chain, or because they're going to become a chunk producer for a new shard in a future epoch), they first identify a point in the chain they'd like to sync to. This is always the first block of the current epoch, which the node should be aware of once it has synced headers to the current point in the chain. The hash of this first block is referred to as the "sync_hash" in many places in the state sync implementation. Then the node makes a request (currently to centralized storage on GCS, but in the future to other nodes in the network) for a `ShardStateSyncResponseHeader` corresponding to that "sync_hash" and
+the Shard ID of the shard it's interested in. Among other things, this header includes the last new chunk before "sync_hash" in the shard, and a `StateRootNode` with hash equal to that chunk's `prev_state_root` field. Then the node downloads (again from GCS, but in the future it'll be from other nodes) the nodes of the trie with that `StateRootNode` as its root. Afterwards, it applies new chunks in the shard until it's caught up.


This section may be a bit too technical for the specification. Perhaps you can move the implementation details to the "reference implementation" section below and here keep it at a higher level?

yea good point, done

wacban

LGTM

After merging into the main PR please see if there are any lint errors and fix those. For some reason those only show up on the main RP.

wacban · 2024-11-15T10:46:13Z

neps/nep-0568.md

+
+The state sync algorithm defines a `sync_hash` that is used in many parts of the implementation. This is always the first block of the current epoch, which the node should be aware of once it has synced headers to the current point in the chain. A node performing state sync first makes a request (currently to centralized storage on GCS, but in the future to other nodes in the network) for a `ShardStateSyncResponseHeader` corresponding to that `sync_hash` and the Shard ID of the shard it's interested in. Among other things, this header includes the last new chunk before `sync_hash` in the shard, and a `StateRootNode` with hash equal to that chunk's `prev_state_root` field. Then the node downloads (again from GCS, but in the future it'll be from other nodes) the nodes of the trie with that `StateRootNode` as its root. Afterwards, it applies new chunks in the shard until it's caught up.
+
+ As described above, the state we download is the state in the shard after applying the second to last new chunk before `sync_hash`, which belongs to the previous epoch (since `sync_hash` is the first block of the new epoch). To move the point in the chain of the initial state download to the current epoch, we could either move the `sync_hash` forward or we could change the state sync protocol (perhaps changing the meaning of the `sync_hash` and the fields of the `ShardStateSyncResponseHeader`, or somehow changing these structures more significantly). The former is an easier first implementation, since it would not require any changes to the state sync protocol other than to the expected `sync_hash`. We would just need to move the `sync_hash` to a point far enough along in the chain so that the `StateRootNode` in the `ShardStateSyncResponseHeader` refers to state in the current epoch.


Can you clarify which one do we want to do?

Resharding V3 - add a few state sync details

39659dc

marcelo-gonzalez requested a review from wacban November 13, 2024 02:29

marcelo-gonzalez requested a review from a team as a code owner November 13, 2024 02:29

wacban reviewed Nov 13, 2024

View reviewed changes

move implementation details to the implementation section

b2df47a

marcelo-gonzalez requested a review from wacban November 14, 2024 22:01

wacban approved these changes Nov 15, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Resharding V3 - add a few state sync details #573

Resharding V3 - add a few state sync details #573

marcelo-gonzalez commented Nov 13, 2024

wacban Nov 13, 2024

marcelo-gonzalez Nov 14, 2024

wacban left a comment

wacban Nov 15, 2024

		When nodes sync state (either because they've fallen far behind the chain, or because they're going to become a chunk producer for a new shard in a future epoch), they first identify a point in the chain they'd like to sync to. This is always the first block of the current epoch, which the node should be aware of once it has synced headers to the current point in the chain. The hash of this first block is referred to as the "sync_hash" in many places in the state sync implementation. Then the node makes a request (currently to centralized storage on GCS, but in the future to other nodes in the network) for a `ShardStateSyncResponseHeader` corresponding to that "sync_hash" and
		the Shard ID of the shard it's interested in. Among other things, this header includes the last new chunk before "sync_hash" in the shard, and a `StateRootNode` with hash equal to that chunk's `prev_state_root` field. Then the node downloads (again from GCS, but in the future it'll be from other nodes) the nodes of the trie with that `StateRootNode` as its root. Afterwards, it applies new chunks in the shard until it's caught up.


		The state sync algorithm defines a `sync_hash` that is used in many parts of the implementation. This is always the first block of the current epoch, which the node should be aware of once it has synced headers to the current point in the chain. A node performing state sync first makes a request (currently to centralized storage on GCS, but in the future to other nodes in the network) for a `ShardStateSyncResponseHeader` corresponding to that `sync_hash` and the Shard ID of the shard it's interested in. Among other things, this header includes the last new chunk before `sync_hash` in the shard, and a `StateRootNode` with hash equal to that chunk's `prev_state_root` field. Then the node downloads (again from GCS, but in the future it'll be from other nodes) the nodes of the trie with that `StateRootNode` as its root. Afterwards, it applies new chunks in the shard until it's caught up.

		As described above, the state we download is the state in the shard after applying the second to last new chunk before `sync_hash`, which belongs to the previous epoch (since `sync_hash` is the first block of the new epoch). To move the point in the chain of the initial state download to the current epoch, we could either move the `sync_hash` forward or we could change the state sync protocol (perhaps changing the meaning of the `sync_hash` and the fields of the `ShardStateSyncResponseHeader`, or somehow changing these structures more significantly). The former is an easier first implementation, since it would not require any changes to the state sync protocol other than to the expected `sync_hash`. We would just need to move the `sync_hash` to a point far enough along in the chain so that the `StateRootNode` in the `ShardStateSyncResponseHeader` refers to state in the current epoch.

Resharding V3 - add a few state sync details #573

Are you sure you want to change the base?

Resharding V3 - add a few state sync details #573

Conversation

marcelo-gonzalez commented Nov 13, 2024

wacban Nov 13, 2024

Choose a reason for hiding this comment

marcelo-gonzalez Nov 14, 2024

Choose a reason for hiding this comment

wacban left a comment

Choose a reason for hiding this comment

wacban Nov 15, 2024

Choose a reason for hiding this comment