cmd, core/state: implement state pruner #21042

rjl493456442 · 2020-05-08T01:34:44Z

This PR offers two commands geth snapshot prune-state and geth snapshot verify-state.
All these two commands require a live snapshot. If you want to use it, generate the snapshot
first with --snapshot enabled.

State pruner

It's a very simple state pruner. The idea is quite straightforward. Whenever we have a live snapshot, we can re-generate the whole state trie from it. Users can pick a specific version snapshot for state regeneration and then wipe all other trie nodes for pruning.

The procedure of pruning can be the following steps:

Generate the state trie of a specific snapshot
Commit all trie nodes as well as the account codes to a file-based temporary database
Iterate the main database, delete all state data(includes the account codes), but the genesis state should be kept.
Compact the whole main db to release disk space
Migrate all data from the temporary database to main one

These following scenarios can happen:

Before we regenerate the state trie, the system exits: In this case, the temporary database is incomplete and will be deleted when the system launches next time.
After we regenerate the state trie, the system exits: In this case, the temporary database is complete. And the more important thing is: we may already prune state data from the main database. So when the system launches next time, the temporary database will be migrated to the main db and wipe it.

Something important:

The whole pruning procedure can take a very long time(e.g. 7 minutes on goerli)

State verifier

Verifier is a simple tool to verify whether the regenerated state trie hash is equal with the original one. Now it's mainly used for testing.

What's more?

Except these two commands, we can offer more functionalities based on the snapshot. One idea is the snapshot can be used to generate arbitrary range trie nodes. If so, we can use it for state repairing. Whenever we hit the trie node missing error, we can just regenerate it instead of throwing out the whole db.

holiman · 2020-10-09T06:27:41Z

cmd/geth/snapshot.go

Wouldn't it be better to return an error instead of just exiting on errors? (re this location and all others)

rjl493456442 · 2020-10-13T10:44:41Z

Running the pruning on the benchmark machine 01;

Database size: 284G
- Ancient: 148G
- Leveldb: 136G
Running time: 3h56m10.068s
- Generate state from snapshot: 59m13.572s
- Prune leveldb: 29m9.624s
- Compact after pruning: 46m55.942s
- Migrate the generated state: 1h40m47.876s

holiman · 2020-10-15T09:01:44Z

As I understand it, this is the current scheme:

Iterate snasphots,
- pipe them into stacktrie,
  - write states (key+value) into filedb,
iterate leveldb
- delete states from leveldb
- range-compact of leveldb
Write back states into leveldb
Delete filedb

This scheme basically empties most of leveldb out, and writes everything back again, which is very heavy IO.
An alternative variant would be to operate on hashes only, and not delete-then-write-back.

Iterate snasphots,
- pipe them into stacktrie,
  - write keys (hashes) into filedb,
iterate leveldb
- delete state if key is not present in filedb

Size of state-keys, if we're assuming that the unpruned state is around 2x, with 1G keys: 1G * 32bytes = 32Gb.
So in the second step, we essentially have to find out whether
a given key (32 bytes) is present among 1000000000 other keys.

EDIT: the key size is really the size of to-be-kept, so not the unpruned size. It won't be 1G keys, but somewhere on the order of 600M on mainnet right now

However, we don't actually have to be fully accurate. If we use a bloom, which has
an error rate of N, it just means our deletion will fail to delete N % of the entries.

As far as I can tell, a bloom filter of ~1.84Gb, with 11 filter functions, would give us an
error rate of 0.05%: https://hur.st/bloomfilter/?n=1000000000&p=0.0005&m=&k=

So we'd wind up with:

Iterate snasphots,
- pipe them into stacktrie,
  - write keys into bloom filter,
iterate leveldb
- delete state if key is not present in filter
Compact

It's a very different approach, I'd be curioous to see the performance difference between these two ways if doing it.
We incidentally have a bloom filter for the same purpose (trie.SyncBloom), more or less, that we use in the downloader, when downloading state.

rjl493456442 requested review from holiman and karalabe as code owners May 8, 2020 01:34

adamschmideg added the status:triage label May 26, 2020

adamschmideg assigned gballet Jun 23, 2020

adamschmideg removed the status:triage label Jun 23, 2020

adamschmideg added the status:triage label Aug 3, 2020

rjl493456442 force-pushed the simple-pruner branch from 1296d66 to f465511 Compare August 27, 2020 12:24

gballet added status:work-in-progress and removed status:triage labels Sep 1, 2020

rjl493456442 force-pushed the simple-pruner branch 2 times, most recently from 363e3b5 to afc4043 Compare October 12, 2020 02:07

rjl493456442 added pr:review and removed status:work-in-progress labels Oct 12, 2020

holiman reviewed Oct 12, 2020

View reviewed changes

rjl493456442 added 15 commits October 13, 2020 19:30

cmd, core, tests: initial state pruner

9107ee4

core: fix db inspector

ba8efc6

cmd/geth: add verify-state

1467930

cmd/geth: add verification tool

6d3f487

core/rawdb: implement flatdb

4d11dcd

cmd, core: fix rebase

732d171

core/state: use new contract code layout

e904d90

core/state/pruner: avoid deleting genesis state

8c0f30f

cmd/geth: add helper function

0b97ce3

core, cmd: fix extract genesis

6d7e028

core: minor fixes

1f08997

all: update sum

5747c8f

contracts: remove useless

80f1576

core/state/snapshot: plugin stacktrie

394312f

core: polish

e29bba8

rjl493456442 added 6 commits October 13, 2020 19:30

core/state/snapshot: iterate storage concurrently

cdc2476

core/state/snapshot: fix iteration

226fa8f

core: add comments

5eff18c

core/state/snapshot: polish code

d86e334

core/state: polish

c1caa9a

core/state/snapshot: rebase

3fe484c

rjl493456442 force-pushed the simple-pruner branch 2 times, most recently from accad99 to 3fe484c Compare October 14, 2020 09:49

rjl493456442 added 4 commits October 14, 2020 17:54

core/rawdb: add comments

ba02664

core/rawdb: fix tests

04c76bb

core/rawdb: improve tests

8fe9b9f

core/state/snapshot: fix concurrent iteration

b6f8f89

core/state: run pruning during the recovery

82e8046

rjl493456442 mentioned this pull request Oct 19, 2020

all: bloom-filter based pruning mechanism #21724

Merged

rjl493456442 closed this Jan 18, 2021

yadsendeww mentioned this pull request Jan 9, 2023

Optimize & Improve Trie, Preparing for state trie pruner kardiachain/go-kardia#237

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

cmd, core/state: implement state pruner #21042

cmd, core/state: implement state pruner #21042

Uh oh!

rjl493456442 commented May 8, 2020 •

edited

Loading

Uh oh!

holiman Oct 9, 2020

Uh oh!

rjl493456442 commented Oct 13, 2020

Uh oh!

holiman commented Oct 15, 2020 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

cmd, core/state: implement state pruner #21042

cmd, core/state: implement state pruner #21042

Uh oh!

Conversation

rjl493456442 commented May 8, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

State pruner

State verifier

What's more?

Uh oh!

holiman Oct 9, 2020

Choose a reason for hiding this comment

Uh oh!

rjl493456442 commented Oct 13, 2020

Uh oh!

holiman commented Oct 15, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

rjl493456442 commented May 8, 2020 •

edited

Loading

holiman commented Oct 15, 2020 •

edited

Loading