-
Notifications
You must be signed in to change notification settings - Fork 79
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
add FFT=DUCC option to makefile #511
Merged
Changes from 15 commits
Commits
Show all changes
19 commits
Select commit
Hold shift + click to select a range
486ae6a
ducc makefile kinda working, still rebuilds lib for every example due…
ahbarnett 69fd51b
make FFT=DUCC working
ahbarnett 155499e
matlab lgomp, needed for DUCC FFT
ahbarnett 1cfeeb2
moved mat/oct flags R2008OO to always apply (was not used for OMP=OFF…
ahbarnett f0aaf71
make examples stop threadsafe tests when OMP=OFF
ahbarnett f8b356a
Merge branch 'master' into makeducc
ahbarnett 7ecc0c1
try to make ci for mac clang work and setup ducc
lu1and10 5ac746a
tidy changelog
ahbarnett 890ff44
cleaned up docs/install re CMake DUCC
ahbarnett 8988f1d
better gnu make docs, include all tasks/options, including DUCC
ahbarnett 1865324
Merge branch 'makeducc' of https://github.com/flatironinstitute/finuf…
ahbarnett f1875fa
let's make -j work with makefile first, need to clean up
lu1and10 cbed1a4
Merge branch 'makeducc' of https://github.com/flatironinstitute/finuf…
ahbarnett 11d8889
edit/rationalize docs/install.rst
ahbarnett f497e22
attempt to doc Joakim's pyproject.toml system
ahbarnett 32138cd
remove troublesome -e from make python; merge CMakeLists.txt
ahbarnett 5441880
update python install docs to remove -e
ahbarnett a8c3a8b
merge CHANGELOG and tidy its GPU list
ahbarnett f41ae29
cmake/ restore
ahbarnett File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -27,11 +27,11 @@ Developer notes | |
|
||
* The kernel function in spreadinterp is evaluated via piecewise-polynomial approximation (Horner's rule). The code for this is auto-generated in MATLAB, for all upsampling factors. There are two versions supported: | ||
|
||
- 2018--2024 vintage: no explicit SIMD vectorization, C code is generated code for the Horner evaluation loop, by running from MATLAB `gen_all_horner_C_code.m` | ||
- 2018--2024 vintage: no explicit SIMD vectorization, C code is generated code for the Horner evaluation loop, by running from MATLAB ``gen_all_horner_C_code.m`` | ||
|
||
- post-2024 vintage: explicit SIMD and many other acceleration tricks, and the generated code is a static C++ array of coefficients, and their sizes (`nc` or number of coefficients) for each width `w`. Run from MATLAB `gen_ker_horner_loop_cpp_code.m` | ||
- post-2024 vintage: explicit SIMD and many other acceleration tricks, and the generated code is a static C++ array of coefficients, and their sizes (``nc`` or number of coefficients) for each width ``w``. Run from MATLAB ``gen_ker_horner_loop_cpp_code.m`` | ||
|
||
See `devel/README` for more details. The ES kernel coefficient and poly approx degree for both of the above are defined in a single location, `devel/get_degree_and_beta.m`, which must match the C++ `setup_spreader()` function. | ||
See ``devel/README`` for more details. The ES kernel coefficient and poly approx degree for both of the above are defined in a single location, ``devel/get_degree_and_beta.m``, which must match the C++ ``setup_spreader()`` function. | ||
|
||
* Continuous Integration (CI). See files for this in ``.github/workflows/``. It currently tests the default ``makefile`` settings in linux, and three other ``make.inc.*`` files covering OSX and Windows (MinGW). CI does not test build the variant OMP=OFF. The dev should test these locally. Likewise, the Julia wrapper is separate and thus not tested in CI. We have added ``JenkinsFile`` for the GPU CI via python wrappers. | ||
|
||
|
@@ -49,7 +49,9 @@ Developer notes | |
|
||
* The cufinufft Python wheels are generated using Docker based on the manylinux2014 image. For instructions, see ``tools/cufinufft/distribution_helper.sh``. These are binary wheels that are built using CUDA 11 (or optionally CUDA 12, but these are not distributed on PyPI) and bundled with the necessary libraries. | ||
|
||
* Testing cufinufft (for FI, mostly) | ||
* CMake compiling on linux at Flatiron Institute (Rusty cluster): We have had a report that if you want to use LLVM, you need to ``module load llvm/16.0.3`` otherwise the default ``llvm/14.0.6`` does not find ``OpenMP_CXX``. | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. maybe this is outdated? @blackwer, could you give a review? |
||
|
||
* Testing cufinufft (for FI, mostly): | ||
|
||
.. code-block:: sh | ||
|
||
|
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This should be not commented out. Please, merge master