VKT: Optimizations #147

jsign · 2022-12-13T23:59:03Z

This PR contains more optimizations apart from the ones in:

The go.mod in this PR uses those branches; we should update the official commits whenever those PRs are merged.

Apart from that, we're doing other optimizations that will be explained in PR comments.

The results are significant comparing beverly-hills vs this PR:

name                               old time/op    new time/op    delta
TriesRandom/VKT/1000_accounts-16     56.8ms ± 3%    22.9ms ± 3%  -59.73%  (p=0.001 n=10+5)
TriesRandom/VKT/5000_accounts-16      264ms ± 2%      76ms ± 2%  -71.34%  (p=0.001 n=10+5)
TriesRandom/VKT/10000_accounts-16     535ms ± 2%     146ms ± 5%  -72.78%  (p=0.001 n=10+5)

name                               old alloc/op   new alloc/op   delta
TriesRandom/VKT/1000_accounts-16     9.56MB ± 0%    9.76MB ± 0%   +2.13%  (p=0.001 n=9+5)
TriesRandom/VKT/5000_accounts-16     43.5MB ± 0%    43.9MB ± 0%   +0.94%  (p=0.002 n=8+5)
TriesRandom/VKT/10000_accounts-16    87.5MB ± 0%    88.2MB ± 0%   +0.83%  (p=0.001 n=10+5)

name                               old allocs/op  new allocs/op  delta
TriesRandom/VKT/1000_accounts-16      32.8k ± 0%     35.5k ± 0%   +8.08%  (p=0.001 n=9+5)
TriesRandom/VKT/5000_accounts-16       151k ± 0%      144k ± 0%   -4.88%  (p=0.001 n=10+5)
TriesRandom/VKT/10000_accounts-16      303k ± 0%      285k ± 0%   -5.97%  (p=0.001 n=10+5)

Signed-off-by: Ignacio Hagopian <[email protected]>

…ory usage Signed-off-by: Ignacio Hagopian <[email protected]>

Signed-off-by: Ignacio Hagopian <[email protected]>

go.mod

jsign · 2022-12-14T00:01:40Z

tests/tries_test.go

@@ -0,0 +1,76 @@
+package tests


Here we have the benchmarks from #146.
I only removed the statelessness one as we discussed, so we avoid test flags.

jsign · 2022-12-14T00:03:02Z

trie/utils/verkle.go

-	trieIndexBytes := treeIndex.Bytes32()
-	verkle.FromBytes(&poly[3], trieIndexBytes[16:])
-	verkle.FromBytes(&poly[4], trieIndexBytes[:16])
+	if !treeIndex.IsZero() {
+		trieIndexBytes := treeIndex.Bytes32()
+		verkle.FromBytes(&poly[3], trieIndexBytes[16:])
+		verkle.FromBytes(&poly[4], trieIndexBytes[:16])
+	}


First optimization is avoiding work if treeIndex is zero, which makes sense and is a pretty normal case.

trie/verkle.go

jsign · 2022-12-14T00:10:49Z

trie/verkle.go

-	resolver :=
-		func(h []byte) ([]byte, error) {
-			return trie.db.diskdb.Get(h)
-		}
+	resolver := func(h []byte) ([]byte, error) {
+		return trie.db.diskdb.Get(h)
+	}


jsign · 2022-12-14T00:16:42Z

trie/verkle.go

-	flush := make(chan verkle.VerkleNode)
+	type vnflush struct {
+		n     verkle.VerkleNode
+		value []byte
+		dbKey []byte
+	}
+	flush := make(chan vnflush, 1024)
 	resolver := func(n verkle.VerkleNode) {
-		flush <- n
+		value, err := n.Serialize()
+		if err != nil {
+			panic(err)
+		}
+		dbKey := nodeToDBKey(n)
+		flush <- vnflush{
+			n:     n,
+			value: value,
+			dbKey: dbKey,
+		}


Looking further at where the wall-clock time was going, I realized that after Commit()ing the underlying go-verkle trie, a significant amount of time was spent flushing the result.

What I did in ethereum/go-verkle#314, is making Flush() for the root node to do the work in parallel. This means that resolver will be called by multiple goroutines balancing more CPU work in all available cores.

To squeeze things more, the Serialize() part of the process was now moved from L258 to the resolver. This means that the Serialize() work will also be done in the goroutines that are flushing the result, and not in a single core.

In a nutshell, since the resolver execution is exploiting all cores, we want to do as much work as possible there, so the main goroutine ranging L269 is receiving plain results to be stored in the diskdb. This main goroutine isn't doing heavy CPU work, just receiving exactly what needs to be stored.

This is important, since if the range in L269 is slow, it will slow down everything. Note how I also made a buffered channel in L246. We need to avoid as much as possible to be blocking goroutines, so having extra breathing room also improved performance.

So if #295 gets merged, Serialize will become a tad more expensive - at the cost of a cheaper deserialization (hence not producing a tree that is crash-prone because of potentially invalid commitments, as in your current approach). Your change, I believe, will mitigate this problem and make it cheaper to follow this approach. That's very nice.

Signed-off-by: Ignacio Hagopian <[email protected]>

…umption Signed-off-by: Ignacio Hagopian <[email protected]>

Signed-off-by: Ignacio Hagopian <[email protected]>

jsign · 2023-03-23T17:02:26Z

Should be interesting if we consider ethereum/go-verkle#314, but for now I'll close this PR.
We can recover it if we go that route.

jsign added 3 commits December 13, 2022 20:50

tests: add mpt vs vkt insertion benchmarks

0fedef4

Signed-off-by: Ignacio Hagopian <[email protected]>

trie/utils: create a fast path if treeindex is zero

45dd248

Signed-off-by: Ignacio Hagopian <[email protected]>

trie/verkle: defer more work to parallelized flush, and avoid 64x mem…

10d1f47

…ory usage Signed-off-by: Ignacio Hagopian <[email protected]>

jsign changed the title ~~Jsign/benchktries~~ VKT: Optimizations Dec 13, 2022

mod: use optimized branches

e889d40

Signed-off-by: Ignacio Hagopian <[email protected]>

jsign force-pushed the jsign/benchktries branch from 13a685b to e889d40 Compare December 14, 2022 00:00

jsign mentioned this pull request Dec 14, 2022

COW in LeafNodes ethereum/go-verkle#314

Draft

jsign commented Dec 14, 2022

View reviewed changes

jsign mentioned this pull request Dec 18, 2022

tests: add mpt vs vkt insertion benchmarks #146

Closed

jsign added 3 commits January 12, 2023 11:11

use updated go-verkle

fa84a2e

Signed-off-by: Ignacio Hagopian <[email protected]>

remove size optimization since go-verkle is not prepared for this ass…

e5e29aa

…umption Signed-off-by: Ignacio Hagopian <[email protected]>

update go-verkle

d3f3017

Signed-off-by: Ignacio Hagopian <[email protected]>

jsign marked this pull request as ready for review January 12, 2023 19:14

jsign requested a review from gballet January 12, 2023 19:14

jsign closed this Mar 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

VKT: Optimizations #147

VKT: Optimizations #147

jsign commented Dec 13, 2022 •

edited

Loading

jsign Dec 14, 2022

jsign Dec 14, 2022

jsign Dec 14, 2022

jsign Dec 14, 2022

gballet Dec 16, 2022

jsign commented Mar 23, 2023

VKT: Optimizations #147

VKT: Optimizations #147

Conversation

jsign commented Dec 13, 2022 • edited Loading

jsign Dec 14, 2022

Choose a reason for hiding this comment

jsign Dec 14, 2022

Choose a reason for hiding this comment

jsign Dec 14, 2022

Choose a reason for hiding this comment

jsign Dec 14, 2022

Choose a reason for hiding this comment

gballet Dec 16, 2022

Choose a reason for hiding this comment

jsign commented Mar 23, 2023

jsign commented Dec 13, 2022 •

edited

Loading