Skip to content

Commit

Permalink
Merge remote-tracking branch 'flatiron/master' into 388-cmake-build-o…
Browse files Browse the repository at this point in the history
…nly-static-by-default
  • Loading branch information
DiamonDinoia committed Jul 24, 2024
2 parents 8592755 + 7f0a986 commit d10356e
Show file tree
Hide file tree
Showing 31 changed files with 2,244 additions and 1,796 deletions.
6 changes: 3 additions & 3 deletions .github/workflows/generate_cmake_matrix.py
Original file line number Diff line number Diff line change
Expand Up @@ -9,15 +9,15 @@
combinations = {
"ubuntu-22.04": {
"compiler": ["llvm", "gcc"],
"arch_flags": ["-march=native", "-march=x86-64"]
"arch_flags": ["-march=native", "-march=x86-64", "native"]
},
"windows-2022": {
"compiler": ["msvc", "llvm"],
"arch_flags": ["/arch:AVX2", "/arch:SSE2"]
"arch_flags": ["/arch:AVX2", "/arch:SSE2", "native"]
},
"macos-13": {
"compiler": ["llvm", "gcc-14"],
"arch_flags": ["-march=native", "-march=x86-64"]
"arch_flags": ["-march=native", "-march=x86-64", "native"]
}
}

Expand Down
16 changes: 15 additions & 1 deletion .pre-commit-config.yaml
Original file line number Diff line number Diff line change
@@ -1,5 +1,19 @@
repos:
- repo: https://github.com/pre-commit/mirrors-clang-format
rev: 'v18.1.6'
rev: 'v18.1.8'
hooks:
- id: clang-format
types_or: [c++, c, cuda]
- repo: https://github.com/pre-commit/pre-commit-hooks
rev: v2.3.0
hooks:
- id: check-yaml
- id: end-of-file-fixer
- id: trailing-whitespace
- repo: https://github.com/cheshirekow/cmake-format-precommit
rev: v0.6.13
hooks:
- id: cmake-format
additional_dependencies: [pyyaml]
types: [file]
files: (\.cmake|CMakeLists.txt)(.in)?$
4 changes: 3 additions & 1 deletion CHANGELOG
Original file line number Diff line number Diff line change
@@ -1,8 +1,10 @@
List of features / changes made / release notes, in reverse chronological order.
If not stated, FINUFFT is assumed (cuFINUFFT <=1.3 is listed separately).

V 2.3.0beta (6/21/24)
V 2.3.0beta (7/21/24)

* ES kernel rescaled to max value 1, reduced horner degrees for upsampfac=1.25
(fixes fp32 overflow issue #454).
* Major acceleration of spread/interp kernels using XSIMD header-only lib,
kernel evaluation, templating by ns with AVX-width-dependent decisions.
Up to 80% faster, dep on compiler. (Marco Barbone with help from Libin Lu).
Expand Down
Loading

0 comments on commit d10356e

Please sign in to comment.