Make neighbor list compatible with float16 and bfloat16 #273

RaulPPelaez · 2024-02-09T12:39:16Z

This PR adds overloads for float16 and bfloat16 input/output of the neighbor extension.

Makes it possible to train in float16 or bfloat16 without casting in between.

guillemsimeon · 2024-02-09T12:50:30Z

do you want me to review?

RaulPPelaez · 2024-02-09T12:51:13Z

no, I will let you know. Thanks

RaulPPelaez · 2024-02-12T09:03:41Z

I cannot enable the neighbor tests for float16. Some test always fails because with the precision being so low some there is a disagreement between the reference in the number of neighbors.

I am not sure exactly why. My guess is that, given that the references are being computed with numpy, they are actually being computed in a higher precision, while the other implementations use float16 all the way.

The tests run with random clouds of points, so I can see things being too close to the cutoff distance for this to be a problem and
float16(r)<=float16(rcut) != float64(r)<=float64(rcut)

This begs the question of whether it makes sense to ask for the neighbors in such a low precision....

RaulPPelaez · 2024-02-12T09:06:55Z

The majority of failing tests yield more neighbor pairs than the reference:

FAILED test_neighbors.py::test_neighbors[dtype0-None-True-True-4.9-128-cuda-brute] - AssertionError: Found num_pairs(118907) > max_num_pairs(118873)
FAILED test_neighbors.py::test_neighbors[dtype0-None-True-True-4.9-128-cuda-shared] - AssertionError: Found num_pairs(118907) > max_num_pairs(118873)
FAILED test_neighbors.py::test_neighbors[dtype0-None-True-False-4.9-128-cuda-brute] - AssertionError: Found num_pairs(112430) > max_num_pairs(112396)
FAILED test_neighbors.py::test_neighbors[dtype0-None-True-False-4.9-128-cuda-shared] - AssertionError: Found num_pairs(112430) > max_num_pairs(112396)
FAILED test_neighbors.py::test_neighbors[dtype0-None-False-True-4.9-128-cuda-brute] - AssertionError: Found num_pairs(62692) > max_num_pairs(62675)
FAILED test_neighbors.py::test_neighbors[dtype0-None-False-True-4.9-128-cuda-shared] - AssertionError: Found num_pairs(62692) > max_num_pairs(62675)
FAILED test_neighbors.py::test_neighbors[dtype0-None-False-False-4.9-128-cuda-brute] - AssertionError: Found num_pairs(56215) > max_num_pairs(56198)
FAILED test_neighbors.py::test_neighbors[dtype0-None-False-False-4.9-128-cuda-shared] - AssertionError: Found num_pairs(56215) > max_num_pairs(56198)
FAILED test_neighbors.py::test_neighbors[dtype0-triclinic-True-True-3.0-128-cuda-brute] - AssertionError: assert (2, 54285) == (2, 54287)
FAILED test_neighbors.py::test_neighbors[dtype0-triclinic-True-True-3.0-128-cuda-shared] - AssertionError: assert (2, 54285) == (2, 54287)
FAILED test_neighbors.py::test_neighbors[dtype0-triclinic-True-True-4.9-1-cuda-brute] - AssertionError: Found num_pairs(4414) > max_num_pairs(4412)
FAILED test_neighbors.py::test_neighbors[dtype0-triclinic-True-True-4.9-1-cuda-shared] - AssertionError: Found num_pairs(4414) > max_num_pairs(4412)
FAILED test_neighbors.py::test_neighbors[dtype0-triclinic-True-True-4.9-2-cuda-brute] - AssertionError: Found num_pairs(4887) > max_num_pairs(4885)
FAILED test_neighbors.py::test_neighbors[dtype0-triclinic-True-True-4.9-2-cuda-shared] - AssertionError: Found num_pairs(4887) > max_num_pairs(4885)
FAILED test_neighbors.py::test_neighbors[dtype0-triclinic-True-True-4.9-3-cuda-brute] - AssertionError: Found num_pairs(5089) > max_num_pairs(5087)

peastman · 2024-02-12T16:15:35Z

Building neighbor lists in low precision requires some care. You need to make sure the calculated distance always gets rounded down, never up, so it errs on the side of including extra pairs rather than omitting pairs. Then you need to make sure any code that uses the neighbor list can tolerate extra pairs that are beyond the cutoff.

Make neighbor list compatible with float16 and bfloat16

f38d673

RaulPPelaez added 3 commits February 9, 2024 16:55

Replace frobenius_norm by norm

4434dd9

Remove old sqrt overloads

72272f5

format

5eef965

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make neighbor list compatible with float16 and bfloat16 #273

Make neighbor list compatible with float16 and bfloat16 #273

RaulPPelaez commented Feb 9, 2024

guillemsimeon commented Feb 9, 2024

RaulPPelaez commented Feb 9, 2024

RaulPPelaez commented Feb 12, 2024

RaulPPelaez commented Feb 12, 2024

peastman commented Feb 12, 2024

Make neighbor list compatible with float16 and bfloat16 #273

Are you sure you want to change the base?

Make neighbor list compatible with float16 and bfloat16 #273

Conversation

RaulPPelaez commented Feb 9, 2024

guillemsimeon commented Feb 9, 2024

RaulPPelaez commented Feb 9, 2024

RaulPPelaez commented Feb 12, 2024

RaulPPelaez commented Feb 12, 2024

peastman commented Feb 12, 2024