fix split_by_feature bug #289

billbrod · 2025-01-10T21:29:17Z

Small bugfix: in split_by_feature for the additive basis, jax tree map was sorting the dictionary that was being used to be alphabetical with respect to the labels of basis1 and basis2. Thus, if those two basis objects had different n_basis_input values and were passed in using a different order than their alphabetical sorting, split_by_feature would fail. This fixes that by using an OrderedDict

It doesn't look like this was an issue for the MultiplicativeBasis, but I added that test anyway, let me know if you want me to remove it.

codecov-commenter · 2025-01-10T21:48:26Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 97.30%. Comparing base (a510ef3) to head (588ef3d).
Report is 91 commits behind head on development.

Additional details and impacted files

@@               Coverage Diff               @@
##           development     #289      +/-   ##
===============================================
+ Coverage        96.13%   97.30%   +1.16%     
===============================================
  Files               34       35       +1     
  Lines             2642     2779     +137     
===============================================
+ Hits              2540     2704     +164     
+ Misses             102       75      -27

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

BalzaniEdoardo

Great catch, if you add a couple of different input shapes in the test, this is good to go for me!

BalzaniEdoardo · 2025-01-10T22:46:00Z

tests/test_basis.py

@@ -304,6 +304,20 @@ def test_expected_output_split_by_feature(basis_instance, super_class):
        np.testing.assert_array_equal(xx[~nans], x[~nans])


+@pytest.mark.parametrize("composite_op", ["add", "multiply"])


This looks good, can you parametrize a few input shapes? Like (n,) (n,1) (n, 2), (n,1,2)

currently failing, need to modify behavior

BalzaniEdoardo · 2025-01-13T22:59:19Z

Changed in this PR:

split_by_feature: the method splits the feature axis using the shape of the provided input. Example, if a basis with 5 elements processed an input of shape (n_samples, 1, 2 ,3), the feature axis, which will be of length 1*2*3*5 will be reshaped to (1, 2, 3, 5).
__iter__: method for iterating over the additive components, which usually corresponds to different task variables.
__len__: returns the number of components.

…l use)

billbrod

Can you write out the attribute renaming that you did? Otherwise this looks good to me!

src/nemos/basis/_basis.py

billbrod · 2025-01-14T15:56:02Z

To be explicit: this PR changes the behavior of split_by_feature. Previously, it would always return a 3d array.

Now, basis.input_shape is set when compute_features or set_input_shape is called on a Nd array X, storing X.shape[1:] (it can thus be empty, if X is 1d). Then, basis.split_by_feature(X) returns an array of shape (n_samples, *basis.input_shape, basis.n_basis_funcs).

Some examples:

n_samples = 100
basis = nmo.basis.RaisedCosineLinearEval(7)
input_shape = (4, 5, 10) # or some other tuple of ints
X = np.random.rand(n_samples, *input_shape)
split = basis.split_by_feature(b.compute_features(X))
split[basis.label].shape
>>> (100, 4, 5, 10, 7)

n_samples = 100
basis = nmo.basis.RaisedCosineLinearEval(7)
input_shape = () # or some other tuple of ints
X = np.random.rand(n_samples, *input_shape)
split = basis.split_by_feature(b.compute_features(X))
split[basis.label].shape
>>> (100, 7)

Co-authored-by: William F. Broderick <[email protected]>

BalzaniEdoardo · 2025-01-14T20:26:11Z

Additional changes to _basis.py:

Renamed _n_basis_input_ to '_input_shape_product`, a more descriptive name.
Removed the propertyn_basis_input_ since the attribute is used internally only.
New public property input_shape which returns the input.shape[1:] for atomic bases and multiplicative bases, and a list of all input shapes for additive bases.

fix split_by_feature bug

b7e93d7

billbrod requested a review from BalzaniEdoardo as a code owner January 10, 2025 21:29

fix failing isort

c56bbca

BalzaniEdoardo requested changes Jan 10, 2025

View reviewed changes

billbrod and others added 7 commits January 13, 2025 11:00

adds tests

3a5cdc0

currently failing, need to modify behavior

changed params names and clone behavior

95c3a14

fixed tests

1f169e2

linted

36e7a76

fix doctests

76cb9bd

fix doctests

024f03a

added tests for iter and len

ccdd8a5

BalzaniEdoardo added 4 commits January 13, 2025 18:27

removed public property n_basis_input_ (keep private attr for interna…

4e192ba

…l use)

removed public property n_basis_input_ (keep private attr for interna…

90c9b9b

…l use)

linted

a8a71a7

fix comment

1f3aa0b

billbrod mentioned this pull request Jan 14, 2025

Should basis have __getitem__? #290

Open

billbrod commented Jan 14, 2025

View reviewed changes

src/nemos/basis/_basis.py Outdated Show resolved Hide resolved

src/nemos/basis/_basis.py Show resolved Hide resolved

BalzaniEdoardo and others added 4 commits January 14, 2025 15:13

Update src/nemos/basis/_basis.py

5182d67

Co-authored-by: William F. Broderick <[email protected]>

Update src/nemos/basis/_basis.py

4d0d4a7

Co-authored-by: William F. Broderick <[email protected]>

changed docstrings

58b2d1f

changed var name

54c4c60

Merge branch 'development' into composite_ordering

588ef3d

BalzaniEdoardo merged commit f5d3fde into development Jan 14, 2025
13 checks passed

BalzaniEdoardo deleted the composite_ordering branch January 14, 2025 22:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix split_by_feature bug #289

fix split_by_feature bug #289

billbrod commented Jan 10, 2025

codecov-commenter commented Jan 10, 2025 •

edited

Loading

BalzaniEdoardo left a comment

BalzaniEdoardo Jan 10, 2025

BalzaniEdoardo commented Jan 13, 2025

billbrod left a comment

billbrod commented Jan 14, 2025 •

edited

Loading

BalzaniEdoardo commented Jan 14, 2025 •

edited

Loading

		@@ -304,6 +304,20 @@ def test_expected_output_split_by_feature(basis_instance, super_class):
		np.testing.assert_array_equal(xx[~nans], x[~nans])


		@pytest.mark.parametrize("composite_op", ["add", "multiply"])

fix split_by_feature bug #289

fix split_by_feature bug #289

Conversation

billbrod commented Jan 10, 2025

codecov-commenter commented Jan 10, 2025 • edited Loading

Codecov Report

BalzaniEdoardo left a comment

Choose a reason for hiding this comment

BalzaniEdoardo Jan 10, 2025

Choose a reason for hiding this comment

BalzaniEdoardo commented Jan 13, 2025

Changed in this PR:

billbrod left a comment

Choose a reason for hiding this comment

billbrod commented Jan 14, 2025 • edited Loading

BalzaniEdoardo commented Jan 14, 2025 • edited Loading

codecov-commenter commented Jan 10, 2025 •

edited

Loading

billbrod commented Jan 14, 2025 •

edited

Loading

BalzaniEdoardo commented Jan 14, 2025 •

edited

Loading