Optimize qubit hash for Set operations #6908

daxfohl · 2025-01-01T19:38:20Z

Change the hash function from tuple, to manually multiplying each term by 1_000_003, which is also the term multiplier Python uses internally for strings and complex ints. This hashes at the same speed as the tuple, but maintains a linear relationship with each term, which reduces the number of bucket collisions in the hash tables underlying Sets and Dicts for line and grid qubits. Improves amortized Set operations perf such as the below by around 50%.

s = set()
for q in cirq.GridQubit.square(100):
    s = s.union({q})

Fixes #6886

Improves amortized `Set` operations perf by around 50%, though with the caveat that sets with qudits of different dimensions but the same index will always have the same key (not just the same bucket), and thus have to check `__eq__`, causing degenerate perf impact. It seems unlikely that anyone would intentionally do this though. ```python s = set() for q in cirq.GridQubit.square(100): s = s.union({q}) ```

codecov · 2025-01-02T01:18:35Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 97.87%. Comparing base (5d317ba) to head (57468b5).
Report is 2 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #6908   +/-   ##
=======================================
  Coverage   97.87%   97.87%           
=======================================
  Files        1084     1084           
  Lines       94406    94408    +2     
=======================================
+ Hits        92396    92398    +2     
  Misses       2010     2010

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

maffoo · 2025-01-02T18:07:39Z

cirq-core/cirq/devices/grid_qubit.py

+            # This approach seems to perform better than traditional "random" hash in `Set`
+            # operations for typical circuits, as it reduces bucket collisions. Caveat: it does not


How did you evaluate this reduction in bucket collisions? Would be good to show this explicitly before we decide to abandon the standard tuple hash.

Test code is up in the description. It's about 50% faster with this implementation.

One note is that it seems like it's only faster for copy-on-change ops like s = s.union({q}). It doesn't seem to have any effect when we operate on sets mutably like s |= {q}. But given most of our stuff is immutable, we see a lot more of the former in our codebase.

maffoo · 2025-01-02T18:22:40Z

cirq-core/cirq/devices/grid_qubit.py

+            square_index = max(abs_row, abs_col)
+            inner_square_side_len = square_index * 2 - 1
+            outer_square_side_len = inner_square_side_len + 2
+            inner_square_area = inner_square_side_len**2
+            if abs_row == square_index:
+                offset = 0 if row < 0 else outer_square_side_len
+                i = inner_square_area + offset + (col + square_index)
+            else:
+                offset = (2 * outer_square_side_len) + (0 if col < 0 else inner_square_side_len)
+                i = inner_square_area + offset + (row + (square_index - 1))
+            self._hash = hash(i)


It looks like this is almost 3x slower than the current tuple hash, which is quite a big regression so unless we can really show that this reduces hash collisions I'm not sure we would want to make this change.

In [1]: def tuple_hash(row, col, d): ...: return hash((row, col, d)) ...: In [2]: def square_hash(row, col, d): ...: if row == 0 and col == 0: ...: return 0 ...: abs_row = abs(row) ...: abs_col = abs(col) ...: square_index = max(abs_row, abs_col) ...: inner_square_side_len = square_index * 2 - 1 ...: outer_square_side_len = inner_square_side_len + 2 ...: inner_square_area = inner_square_side_len**2 ...: if abs_row == square_index: ...: offset = 0 if row < 0 else outer_square_side_len ...: i = inner_square_area + offset + (col + square_index) ...: else: ...: offset = (2 * outer_square_side_len) + (0 if col < 0 else inner_square_side_len) ...: i = inner_square_area + offset + (row + (square_index - 1)) ...: return hash(i) ...: In [3]: %timeit [tuple_hash(r, c, d) for r in range(20) for c in range(20) for d in [2, 3, 4]] 151 µs ± 427 ns per loop (mean ± std. dev. of 7 runs, 10,000 loops each) In [4]: %timeit [square_hash(r, c, d) for r in range(20) for c in range(20) for d in [2, 3, 4]] 437 µs ± 2.37 µs per loop (mean ± std. dev. of 7 runs, 1,000 loops each)

I'm not married to it. It was something I noticed when looking into creating very wide circuits and got nerd sniped. It's a reasonable optimization for copy-on-change operations on large sets. But if we want to stick to the existing approach, I'd say it's completely justifiable.

Instead of the fancy plane-covering algorithm, I realized we could just hash the complex number row + col * 1j. This ends up being only about 2.5x faster than the fancy plane-covering hash, but still 30% slower than the tuple hash, to hash a million distinct GridQubits, yet still 50% faster than the tuple hash to do set unions on a 100x100 GridQubit square.

Then, looking up the actual algorithm for hashing complex numbers, it's just real_part + complex_part * sys.hash_info.imag. So, switching the algorithm to that, now it's about 30% faster than the tuple hash to hash a million distinct GridQubits, and still 50% faster to do set unions on a 100x100 GridQubit square. Plus, it looks like....a normal hash function. (I feel kind of silly now for not trying this first).

So, vastly simplified the code, and it's faster for all "normal" cases now, but the caveat still applies about it being slow on sets that have multiple qudits of different dimensions on the same grid position.

And, finally coming to my senses, I included the dimension term in the hash, which slows it back down to exactly the tuple hash speed, but is still 50% faster on set unions. But now it is a more standard hash function, including all attributes.

I'm going to mark the PR as ready again; at this point it seems like a pretty straightforward improvement with no downside.

…complex number.

daxfohl requested review from vtomole and a team as code owners January 1, 2025 19:38

daxfohl requested a review from mhucka January 1, 2025 19:38

CirqBot added the size: S 10< lines changed <50 label Jan 1, 2025

daxfohl added 2 commits January 2, 2025 09:37

format

08100cc

Merge branch 'main' into qubit-hash

5ab4457

maffoo reviewed Jan 2, 2025

View reviewed changes

daxfohl added 2 commits January 2, 2025 12:52

Slightly cleaner code

08f3b87

format

ac5a752

daxfohl marked this pull request as draft January 6, 2025 17:28

daxfohl added 3 commits January 10, 2025 23:28

Improve / simplify GridQubit hash implementation to be the same as a …

e89ddb3

…complex number.

Remove irrelevant test

c0ce4a0

Include dimension

1dbfd49

daxfohl marked this pull request as ready for review January 12, 2025 06:43

daxfohl added 2 commits January 11, 2025 22:43

lint

04c6436

Merge branch 'main' into qubit-hash

57468b5

daxfohl requested a review from maffoo January 12, 2025 06:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize qubit hash for Set operations #6908

Optimize qubit hash for Set operations #6908

daxfohl commented Jan 1, 2025 •

edited

Loading

codecov bot commented Jan 2, 2025 •

edited

Loading

maffoo Jan 2, 2025

daxfohl Jan 2, 2025 •

edited

Loading

maffoo Jan 2, 2025

daxfohl Jan 2, 2025

daxfohl Jan 11, 2025 •

edited

Loading

daxfohl Jan 12, 2025 •

edited

Loading

		# This approach seems to perform better than traditional "random" hash in `Set`
		# operations for typical circuits, as it reduces bucket collisions. Caveat: it does not

Optimize qubit hash for Set operations #6908

Are you sure you want to change the base?

Optimize qubit hash for Set operations #6908

Conversation

daxfohl commented Jan 1, 2025 • edited Loading

codecov bot commented Jan 2, 2025 • edited Loading

Codecov Report

maffoo Jan 2, 2025

Choose a reason for hiding this comment

daxfohl Jan 2, 2025 • edited Loading

Choose a reason for hiding this comment

maffoo Jan 2, 2025

Choose a reason for hiding this comment

daxfohl Jan 2, 2025

Choose a reason for hiding this comment

daxfohl Jan 11, 2025 • edited Loading

Choose a reason for hiding this comment

daxfohl Jan 12, 2025 • edited Loading

Choose a reason for hiding this comment

daxfohl commented Jan 1, 2025 •

edited

Loading

codecov bot commented Jan 2, 2025 •

edited

Loading

daxfohl Jan 2, 2025 •

edited

Loading

daxfohl Jan 11, 2025 •

edited

Loading

daxfohl Jan 12, 2025 •

edited

Loading