REF: Change how costs are reduced #327

carterbox · 2024-07-22T16:28:24Z

Purpose

Try to avoid host synchronization during cost function computation.

Approach

Store objective function values in an GPU array until the end of the epoch.

Pre-Merge Checklists

Submitter

Write a helpfully descriptive pull request title.
Organize changes into logically grouped commits with descriptive commit messages.
Document all new functions.
Click 'details' on the readthedocs check to view the updated docs.
Write tests for new functions or explain why they are not needed.
Address any complaints from pep8speaks.

Reviewer

Actually read all of the code.
Run the new code yourself; the included tests should make this easy.
Write a summary of the changes as you understand them.
Thank the submitter.

pep8speaks · 2024-07-22T16:28:29Z

Hello @carterbox! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:

In the file src/tike/ptycho/solvers/lstsq.py:

Line 291:81: E501 line too long (90 > 80 characters)
Line 321:81: E501 line too long (90 > 80 characters)

In the file src/tike/ptycho/solvers/rpie.py:

Line 105:81: E501 line too long (85 > 80 characters)
Line 185:81: E501 line too long (82 > 80 characters)

Comment last updated at 2024-07-29 15:19:00 UTC

a4894z · 2024-07-24T15:09:45Z

src/tike/ptycho/ptycho.py

I had to look up what itertools.chain() does; it looks like the point of this change is to not reduce (i.e. compute the mean) of the costs here in ptycho.py because we need these for momentum acceleration in rPIE or LSTSQ?

And by using itertools.chain() here, we keep track of the costs vs scan position until any momentum acceleration computation is completed and then we can take the mean in LSTSQ or rPIE?

We want to track a unique objective value for each GPU because of momentum acceleration. e.g. If you do a reconstruction with 4 GPUs for 100 epochs, the resulting costs matrix should be shaped (100, 4).

Locally (to each GPU), the convergence behavior can be different than the other GPUs, so we don't want to use the global mean cost to make decisions about momentum acceleration.

The FIXME exists because I think the momentum acceleration method is still using the global mean to make decisions about momentum acceleration. I should just fix that before merging this PR.

REF: Change how costs are reduced

d5a5859

carterbox requested a review from a4894z July 22, 2024 16:28

a4894z reviewed Jul 24, 2024

View reviewed changes

a4894z self-requested a review July 24, 2024 15:18

carterbox marked this pull request as draft July 24, 2024 15:56

carterbox and others added 2 commits July 24, 2024 17:24

BUG: Use local errors for checked momentum

8c1561a

Merge branch 'main' into costs-reduce

adc1693

carterbox marked this pull request as ready for review July 29, 2024 18:56

a4894z merged commit 519a331 into AdvancedPhotonSource:main Jul 31, 2024
7 checks passed

carterbox deleted the costs-reduce branch July 31, 2024 18:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

REF: Change how costs are reduced #327

REF: Change how costs are reduced #327

carterbox commented Jul 22, 2024

pep8speaks commented Jul 22, 2024 •

edited

Loading

a4894z Jul 24, 2024

carterbox Jul 24, 2024

REF: Change how costs are reduced #327

REF: Change how costs are reduced #327

Conversation

carterbox commented Jul 22, 2024

Purpose

Approach

Pre-Merge Checklists

Submitter

Reviewer

pep8speaks commented Jul 22, 2024 • edited Loading

Comment last updated at 2024-07-29 15:19:00 UTC

a4894z Jul 24, 2024

Choose a reason for hiding this comment

carterbox Jul 24, 2024

Choose a reason for hiding this comment

pep8speaks commented Jul 22, 2024 •

edited

Loading