Heap profiling for memory bottlenecks discovery #113

CPerezz · 2022-12-23T11:39:13Z

Use ad-hoc heap allocation profiling to see how memory is being consumed and how can we optimize it.

See https://docs.rs/dhat/latest/dhat/ docs for more info

CPerezz · 2024-02-02T13:56:43Z

I think @ed255 actually did this and saw the place where memory spikes the most (when we hold Evaluation & Coefficient form of all our polynomials).

And IIRC, I saw that we can't really avoid that (although we can revisit ofc).

@ed255 Can you maybe post a summary here such taht we can trigger a discussion on how/if is worth solving this?

ed255 · 2024-02-05T11:07:04Z

I used dhat to profile memory usage and make sure that the frontend-backend split didn't increase the memory consumption compared to the legacy halo2. My impressions of the profiler were:

Positive: It's very easy to use, you just import it and create the profiler variable and that's it.
Negative: Sometimes the report doesn't have the full stack trace of an allocation

I didn't analyze the profiling results to see which part of halo2 was using more memory, I was only paying attention to the comparison between frontend-backend split and legacy halo2; nevertheless on a different occasion I tried to reason about the biggest source of memory usage from halo2 and here are my conclusions

Constants:

M: number of advice columns
- NOTE: This number will be bigger from implicit polynomials required for copy constraints and lookups; so it's more accurate if we say M is the number of committed polynomials.
n: number of rows (2^k)

Once the circuit is big enough, this is the biggest source of memory usage:

In proof creation we hold the witness matrix in memory. This is M * n * sizeof(Assigned<F>)
We convert this to M * n * sizeof(F) in the batch_invert_assigned step
- This reduces the size, because Assigned<F> is ~ 2 * sizeof(F)
Then these vectors are committed, to do so they are extended 2^j times (where j depends on the max expression degree of the circuit). I think the extended form is not kept in memory after commitment, but I'm not sure.
After all commitments are done, we perform an evaluation, which needs all the data from point (2).

From a discussion with @han0110 , he mentioned that Scroll did a trick to cap the max memory usage by: "computing the evaluations on extended domain and quotient chunk by chunk"; but I don't fully understand this idea 😅

jonathanpwang · 2024-02-06T04:47:09Z

Nice write up!

I believe the optimization Han mentioned is scroll-tech#28 (comment)
We have also observed it greatly reduces memory usage.

CPerezz · 2024-02-08T10:23:10Z

Nice write up!

I believe the optimization Han mentioned is scroll-tech#28 (comment) We have also observed it greatly reduces memory usage.

Working on porting this here! Trying to get good benchmarks to make avaliable with the PR.

…-hk/dev-fix/double-add-docs Resolve "Add documentation for Double-and-Add"

davidnevadoc · 2024-06-11T15:54:59Z

Superseded by #291

CPerezz added the enhancement New feature or request label Dec 23, 2022

CPerezz added this to zkEVM Community Edition Dec 23, 2022

CPerezz moved this to 🆕 Product Backlog Items in zkEVM Community Edition Jan 17, 2023

CPerezz moved this from 🆕 Product Backlog Items to 📋 Refined Backlog in zkEVM Community Edition Jan 17, 2023

davidnevadoc removed this from zkEVM Community Edition Jan 15, 2024

CPerezz self-assigned this Feb 8, 2024

iquerejeta pushed a commit to input-output-hk/halo2 that referenced this issue May 8, 2024

Merge pull request privacy-scaling-explorations#113 from input-output…

826fe0a

…-hk/dev-fix/double-add-docs Resolve "Add documentation for Double-and-Add"

davidnevadoc closed this as completed Jun 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Heap profiling for memory bottlenecks discovery #113

Heap profiling for memory bottlenecks discovery #113

CPerezz commented Dec 23, 2022

CPerezz commented Feb 2, 2024

ed255 commented Feb 5, 2024

jonathanpwang commented Feb 6, 2024

CPerezz commented Feb 8, 2024

davidnevadoc commented Jun 11, 2024

Heap profiling for memory bottlenecks discovery #113

Heap profiling for memory bottlenecks discovery #113

Comments

CPerezz commented Dec 23, 2022

CPerezz commented Feb 2, 2024

ed255 commented Feb 5, 2024

jonathanpwang commented Feb 6, 2024

CPerezz commented Feb 8, 2024

davidnevadoc commented Jun 11, 2024