Appearance
Abstract
This EIP proposes a method to switch the state tree tree format from hexary Merkle Patricia Tree (MPT) to a verkle tree: the MPT tree is frozen, and new writes to the state are stored in a verkle tree “laid over” the hexary MPT. The historical MPT state is left untouched and its eventual migration is handled at a later time.
Motivation
The Ethereum state is growing, and verkle trees offer a good mitigation strategy to stem this growth and enable weak statelessness. Owing to the difficulty of translating contracts with large storage while they are being accessed, proposals for migrating the current MPT state are complex and will require client teams to undergo a long process of refactoring their code to handle this conversion.
The bigger the state, the longer any conversion process will take. This has an impact both while the conversion is happening, as well as when full-syncing the chain if the conversion is part of consensus. Fullsync is used extensively by core dev teams to test the performance of new code. A conversion longer than a month will impact the release schedule of client teams who typically release at this rate. Nodes that cannot follow the conversion will need to wait longer to rejoin. The conversion will also make reorg slower, so reducing its duration is desirable.
This current proposal suggests to stop the MPT state growth in its tracks by activating a new “overlay” verkle tree, that all new state updates are written to. The “base” MPT tree is frozen in place, until all execution clients are ready to perform the full transition. Data is read first from the overlay tree, and if not found there, from the MPT.
Whenever the block that freeze the MPT is finalized, internal node data can be deleted, in order to free up disk space.
Specification
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in RFC 2119 and RFC 8174.
Constants
Parameter | value | Description |
---|---|---|
FORK_TIME | TDB | Time at which the verkle, overlay tree is activated. |
Helper functions
python3
# Determine if `block` is the fork activation block
def is_fork_block(block):
return block.parent.timestamp < FORK_TIME && block.timestamp >= FORK_TIME
# Write an account in the verkle tree
def verkle_set_account(tree: VerkleTree, key: Bytes32, account: Optional[Account]):
if account is not None:
versionkey = key
tree.set(versionkey, 0)
balancekey = key
balancekey[31] = BALANCE_LEAF_KEY
tree.set(balancekey, account.balance)
noncekey = key
noncekey[31] = NONCE_LEAF_KEY
tree.set(noncekey, account.nonce)
ckkey = key
ckkey[31] = CODE_KECCAK_LEAF_KEY
tree.set(ckkey, account.code_hash)
cskey = key
cskey[31] = CODE_SIZE_LEAF_KEY
tree.set(cskey, len(account.code))
# Reads an account from the verkle tree
def verkle_get_account(tree: VerkleTree, key: Bytes32) -> Optional[Account]:
version_leaf = tree.get(key)
if version_leaf is not None:
balancekey = key
balancekey[31] = BALANCE_LEAF_KEY
balance = tree.get(balancekey, account.balance)
noncekey = key
noncekey[31] = NONCE_LEAF_KEY
nonce = tree.get(noncekey)
ckkey = key
ckkey[31] = CODE_KECCAK_LEAF_KEY
ck = tree.get(ckkey)
cskey = key
cskey[31] = CODE_SIZE_LEAF_KEY
cs = tree.set(cskey)
account = Account(0, balance, nonce, ck, cs)
return account
Changes to the execution spec:
In the execution spec, modify the State
class as such:
python3
@dataclass
class State:
"""
Contains all information that is preserved between transactions.
"""
_main_trie: Trie[Address, Optional[Account]] = field(
default_factory=lambda: Trie(secured=True, default=None)
)
_storage_tries: Dict[Address, Trie[Bytes, U256]] = field(
default_factory=dict
)
_snapshots: List[
Tuple[
Trie[Address, Optional[Account]], Dict[Address, Trie[Bytes, U256]]
]
] = field(default_factory=list)
_created_accounts: Set[Address] = field(default_factory=set)
# Added in this EIP
_overlay_tree: VerkleTree[Address, Bytes32]
And the state access functions are modified as such:
python3
def get_account_optional(state: State, address: Address) -> Optional[Account]:
account = verkle_get_account(state._overlay_tree, get_tree_key_for_version(addr))
if account is not None:
return account
return trie_get(state._main_trie, address)
def set_account(state: State, address: Address, account: Optional[Account]) -> None:
verkle_set_account(state._overlay_tree, get_tree_key_for_nonce(addr), account)
def get_storage(state: State, address: Address, key: Bytes) -> U256:
value = state._overlay_tree.get(get_tree_key_for_storage_slot(addr, slot))
if value is not None:
return value
trie = state._storage_tries.get(address)
if trie is None:
return U256(0)
value = trie_get(trie, key)
assert isinstance(value, U256)
return value
def set_storage(
state: State, address: Address, key: Bytes, value: U256
) -> None:
state._overlay_tree.set(get_tree_key_for_storage_slot(addr, slot), value)
Rationale
This approach doesn't convert the state, which is left to a subsequent EIP. This is meant as a stopgap in case we decide to push the conversion itself to a later time. It has the advantage of simplicity, which means that the Verge fork could happen at the same time as other, simpler EIPs. It also requires no change at the consensus layer.
Backwards Compatibility
No backward compatibility issues found.
Test Cases
Reference Implementation
transition-post-genesis
branch ingithub.com/gballet/go-ethereum
implements this when setting--override.overlay-stride=0
on the command line.
Security Considerations
Needs discussion.
Copyright
Copyright and related rights waived via CC0.