perf: single solver shared by multiple paths #218

daejunpark · 2023-11-10T19:51:08Z

share a single solver for all paths
full dfs path exploration across external call boundaries
use z3 push/pop to avoid the solver reinstantiation overhead during the dfs
generate counterexamples in parallel with the dfs
a new option for early exit upon finding the first counterexample
a new option to set the number of threads for parallel solvers

note: the farcaster tests are temporarily disabled in ci, due to independent issues that are to be fixed in a separate pr

0xkarmacoma · 2023-11-28T00:10:13Z

src/halmos/sevm.py

@@ -815,7 +864,7 @@ def sha3_data(self, data: Bytes, size: int) -> Word:
        sha3_expr = f_sha3(data)

        # assume hash values are sufficiently smaller than the uint max
-        self.solver.add(ULE(sha3_expr, con(2**256 - 2**64)))
+        self.path.append(ULE(sha3_expr, con(2**256 - 2**64)))


do you think it would be useful to expose an additional method like add_assertion? It feels like this is a way to add it to the solver assertions but as a side effect it is also appended to the list of human-readable path conditions, when it doesn't really correspond to a path (branching) condition

or just path.assert(cond) for short

good point! added a todo note to get back to this later.

0xkarmacoma · 2023-11-28T00:26:28Z

src/halmos/sevm.py

@@ -596,6 +643,7 @@ class Exec:  # an execution path

    # tx
    context: CallContext
+    callback: Any  # to be called when returning back to the parent context


can we use something like Optional[Callable] as the type hint here?

0xkarmacoma · 2023-11-28T00:29:25Z

src/halmos/sevm.py

        self.context = kwargs["context"]
+        self.callback = kwargs["callback"]


given that callback is optional (can be None), can we also make it optional in the kwargs?

something like self.callback = kwargs.get("callback", None)

the callback is None only at the top-level. i prefer to non-optional in the kwargs, so that i can catch earlier if callback is missing by mistake.

0xkarmacoma · 2023-11-28T00:35:18Z

src/halmos/sevm.py

                    if opcode == EVM.JUMPI:
-                        steps[step_id] = {"parent": prev_step_id, "exec": str(ex)}
+                        self.steps[step_id] = {"parent": prev_step_id, "exec": str(ex)}
                    # elif opcode == EVM.CALL:
-                    #     steps[step_id] = {'parent': prev_step_id, 'exec': str(ex) + ex.st.str_memory() + '\n'}
+                    #     self.steps[step_id] = {'parent': prev_step_id, 'exec': str(ex) + ex.st.str_memory() + '\n'}
                    else:
-                        # steps[step_id] = {'parent': prev_step_id, 'exec': ex.summary()}
-                        steps[step_id] = {"parent": prev_step_id, "exec": str(ex)}
+                        # self.steps[step_id] = {'parent': prev_step_id, 'exec': ex.summary()}
+                        self.steps[step_id] = {"parent": prev_step_id, "exec": str(ex)}


not directly related to this PR, but it looks like both branches are doing the same thing and could be simplified

good catch! it's legacy code, and i need to clean this up anyway; will do this later.

0xkarmacoma · 2023-11-28T00:39:17Z

src/halmos/sevm.py

+                if ex.callback is None:
+                    yield ex
+                else:
+                    yield from ex.callback(ex, stack, step_id)


can we extract these 4 lines in a helper function? They seem to be duplicated a couple times

a comment might help understanding the yield from ex.callback(...), it takes some thinking to figure out 🧙‍♂️

0xkarmacoma · 2023-11-28T00:44:43Z

src/halmos/sevm.py

+        if len(self.pending) > 0:
+            raise ValueError("deepcopy pending path", self)
+
+        path = Path(self.solver)


might be worth a comment here. This is where the real magic happens since we re-use the solver instance instead of actually copying it, right? Maybe explain in a sentence or two why we do this and the trade-offs

0xkarmacoma · 2023-11-28T00:51:14Z

src/halmos/sevm.py

-        new_path = ex.path.copy()
-        new_path.append(str(cond))
+        new_path = deepcopy(ex.path)
+        new_path.pending.append(cond)


style: I had to hunt for this, I couldn't immediately see in Path where things were added to pending. Would be mildly cleaner/easier with an add_pending(cond) method IMO

0xkarmacoma · 2023-11-28T00:54:13Z

src/halmos/sevm.py

@@ -585,6 +585,53 @@ def __next__(self) -> Instruction:
        return insn


+class Path:


nit: would you say that an instance of this class does represent a single path through the program? My instinct says no, because it makes it a little weird to initialize an SEVM with a Path object (since I don't really have a path before we get started).

Semantically, would something like PathTracker or something similar be more accurate?

fair point! but in my mental model, it's still a path. i added explanation in the comment for now. we can discuss more to improve naming.

0xkarmacoma

Very clean! See minor comments and requests for clarifications

daejunpark added 24 commits November 1, 2023 15:28

perf: share solver for all paths

af4a9d1

perf: do not call solver if simplify is enough

23ca9b2

perf: path condition slicing

54aed85

perf: default branching timeout to 1s

b3522ce

tmp debugging

c780130

perf: global dfs

3fc2f59

wip

8e5562e

wip

3d8dac5

move halmos logs to sevm

6a152c8

move steps to sevm

23a2b89

early exit

815adbf

fix early exit

fb2d96e

cleanup for minimal changes to highlight main idea

2d9fd35

wip: increase timeout for resolving addresses

90114a9

cleanup

8d110ce

cleanup

738a3db

cleanup: yield from sevm.run

7ae9a8a

cleanup: remove gen_model

5625d01

cleanup: setup iterator

7cb5601

cleanup: dump smt queries

d953a20

tmp: disable farcaster test

dc1151e

test: remove test-parallel from longer tests

d464a07

add --solver-threads option

c7b1765

fix lint

251390f

daejunpark force-pushed the perf/solver-reuse branch from 4ca5985 to 251390f Compare November 26, 2023 03:22

daejunpark marked this pull request as ready for review November 26, 2023 03:27

0xkarmacoma self-requested a review November 27, 2023 23:58

0xkarmacoma reviewed Nov 28, 2023

View reviewed changes

0xkarmacoma approved these changes Nov 28, 2023

View reviewed changes

address review comments

06743c1

daejunpark force-pushed the perf/solver-reuse branch from e451ba5 to 7461366 Compare November 29, 2023 02:13

daejunpark added 2 commits November 28, 2023 18:32

use pop(N) instead of looping

b306103

use new solver for checking validity of setup execs

b8efcee

daejunpark force-pushed the perf/solver-reuse branch from 7461366 to b8efcee Compare November 29, 2023 03:04

daejunpark merged commit 380314a into main Nov 29, 2023
61 checks passed

daejunpark deleted the perf/solver-reuse branch November 29, 2023 03:31

daejunpark mentioned this pull request Nov 29, 2023

fix: decoding storage mapping with bytes key #221

Merged

daejunpark mentioned this pull request Dec 20, 2023

fix: trace generation for shared solver #231

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: single solver shared by multiple paths #218

perf: single solver shared by multiple paths #218

daejunpark commented Nov 10, 2023 •

edited

Loading

0xkarmacoma Nov 28, 2023

0xkarmacoma Nov 28, 2023

daejunpark Nov 28, 2023

0xkarmacoma Nov 28, 2023

0xkarmacoma Nov 28, 2023

daejunpark Nov 28, 2023 •

edited

Loading

0xkarmacoma Nov 28, 2023

daejunpark Nov 28, 2023

0xkarmacoma Nov 28, 2023

0xkarmacoma Nov 28, 2023

0xkarmacoma Nov 28, 2023

0xkarmacoma Nov 28, 2023

daejunpark Nov 28, 2023

0xkarmacoma left a comment

		self.context = kwargs["context"]
		self.callback = kwargs["callback"]

		@@ -585,6 +585,53 @@ def __next__(self) -> Instruction:
		return insn


		class Path:

perf: single solver shared by multiple paths #218

perf: single solver shared by multiple paths #218

Conversation

daejunpark commented Nov 10, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

daejunpark Nov 28, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

0xkarmacoma left a comment

Choose a reason for hiding this comment

daejunpark commented Nov 10, 2023 •

edited

Loading

daejunpark Nov 28, 2023 •

edited

Loading