sasrec tests #193

spirinamayya · 2024-09-27T09:50:14Z

added sasrec test

blondered · 2024-10-01T13:12:24Z

rectools/models/sasrec.py

        raise NotImplementedError()


 class IdEmbeddingsItemNet(ItemNetBase):
    """
-    Base class for item embeddings. To use more complicated logic then just id embeddings inherit
-    from this class and pass your custom ItemNet to your model params
+    Network for item embeddings.


Suggested change

Network for item embeddings.

Network for item embeddings based only on item ids.

blondered · 2024-10-01T13:13:52Z

rectools/models/sasrec.py

        output = self.ff_relu(self.ff_dropout1(self.ff_linear1(seqs)))
        fin = self.ff_dropout2(self.ff_linear2(output))
        return fin


 class SASRecTransformerLayers(TransformerLayersBase):
-    """Exactly SASRec authors architecture but with torch MHA realisation"""
+    """
+    Exactly SASRec author's transformer blocks architecture but with torch MHA realisation.


Suggested change

Exactly SASRec author's transformer blocks architecture but with torch MHA realisation.

Exactly SASRec author's transformer blocks architecture but with pytorch Multi-Head Attention realisation.

blondered · 2024-10-01T13:14:16Z

rectools/models/sasrec.py

+    Parameters
+    ----------
+    n_blocks: int
+        Number of self-attention blocks.


Suggested change

Number of self-attention blocks.

Number of transformer blocks.

blondered · 2024-10-01T13:16:02Z

rectools/models/sasrec.py

+    Parameters
+    ----------
+    n_blocks: int
+        Number of self-attention blocks.


Suggested change

Number of self-attention blocks.

Number of transformer blocks.

blondered · 2024-10-01T13:16:35Z

rectools/models/sasrec.py

+    use_causal_attn: bool, default True
+        If ``True``, causal mask is used in multi-head self-attention.
+    transformer_layers_type: Type(TransformerLayersBase), default `SasRecTransformerLayers`
+        Type of transformer layers used for training.


Suggested change

Type of transformer layers used for training.

Type of transformer layers architecture.

blondered · 2024-10-01T13:21:06Z

rectools/models/sasrec.py

+        Parameters
+        ----------
+        sessions: torch.Tensor
+            User sessions consisting of items.


Suggested change

User sessions consisting of items.

User sessions in the form of sequences of items ids.

blondered · 2024-10-01T13:22:18Z

rectools/models/sasrec.py

+        Parameters
+        ----------
+        sessions:  torch.Tensor
+            User sessions consisting of items.


Suggested change

User sessions consisting of items.

User sessions in the form of sequences of items ids.

blondered · 2024-10-01T13:23:31Z

rectools/models/sasrec.py

+        Returns
+        -------
+        torch.Tensor
+            User sessions with positional encoding if use_pos_emb is ``True``.


Suggested change

User sessions with positional encoding if use_pos_emb is ``True``.

Encoded user sessions with added positional encoding if `use_pos_emb` is ``True``.

blondered · 2024-10-01T13:24:09Z

rectools/models/sasrec.py

+        Parameters
+        ----------
+        sessions: torch.Tensor
+            User sessions consisting of items.


Suggested change

User sessions consisting of items.

User sessions in the form of sequences of items ids.

blondered · 2024-10-01T13:24:36Z

rectools/models/sasrec.py

+    Parameters
+    ----------
+    sessions: List[List[int]]
+        User interaction sequences.


Suggested change

User interaction sequences.

User sessions in the form of sequences of items ids.

blondered · 2024-10-18T10:16:21Z

tests/models/test_sasrec.py

+        model.fit(dataset=dataset)
+        users = np.array([10, 30, 40])
+        actual = model.recommend(users=users, dataset=dataset, k=3, filter_viewed=filter_viewed)
+        actual[Columns.Item] = actual[Columns.Item].apply(int)


This is bad. If items in the dataset were int we need to receive int ids from recommend method. If this doesn't work, we need to find where exactly we are breaking them and fix it. Is it in IdMap when creating new id map with string ["PAD"] and then adding other ids as int?

Let's not to apply(int) or astype(int) because they can modify float values. Pass check_dtype=False to assert_frames_equal or assert_series_equal instead

So we still expect incorrect id types in reco? Could you explain the reason?

We receive int item ids but pd.Series has dtype object. So the values are correct and they are not float. But dtype is not equal to int.

Let's convert them back to int in the code (I mean in the main code, not test)

If I'm a user and I give ints, I expect ints in the reco. Besides, object is much slower for future iterations. And also there may be problems with metric calculation since pandas will fail trying to merge int and object.

feldlime · 2024-10-21T09:02:21Z

tests/models/test_sasrec.py

+from tests.testing_utils import assert_id_map_equal, assert_interactions_set_equal
+
+
+@pytest.mark.filterwarnings("ignore::pytorch_lightning.utilities.warnings.PossibleUserWarning")


Could you please add comments for this and the next line? (What the warnings are and why do we ignore them)

feldlime · 2024-10-21T09:04:20Z

tests/models/test_sasrec.py

+        model.fit(dataset=dataset)
+        users = np.array([10, 30, 40])
+        actual = model.recommend(users=users, dataset=dataset, k=3, filter_viewed=filter_viewed)
+        actual[Columns.Item] = actual[Columns.Item].apply(int)


So we still expect incorrect id types in reco? Could you explain the reason?

Merge with updated branch

spirinamayya added 4 commits September 27, 2024 12:47

added test_sasrec

10d2c83

fixed tests

b6806b5

removed device parameter

df649d1

added docstrings

c4355e1

blondered reviewed Oct 1, 2024

View reviewed changes

blondered requested changes Oct 18, 2024

View reviewed changes

feldlime reviewed Oct 21, 2024

View reviewed changes

spirinamayya added 3 commits October 21, 2024 16:53

Merge branch 'experimental/sasrec' into test/sasrec

a7e310a

Merge with updated branch

changed datatype

63ed746

fixed docstrings

8cb7959

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sasrec tests #193

sasrec tests #193

spirinamayya commented Sep 27, 2024

blondered Oct 1, 2024

blondered Oct 1, 2024

blondered Oct 1, 2024

blondered Oct 1, 2024

blondered Oct 1, 2024

blondered Oct 1, 2024

blondered Oct 1, 2024

blondered Oct 1, 2024

blondered Oct 1, 2024

blondered Oct 1, 2024

blondered Oct 18, 2024

blondered Oct 21, 2024

feldlime Oct 21, 2024

blondered Oct 21, 2024

feldlime Oct 21, 2024

feldlime Oct 21, 2024

feldlime Oct 21, 2024

	Network for item embeddings.
	Network for item embeddings based only on item ids.

	Exactly SASRec author's transformer blocks architecture but with torch MHA realisation.
	Exactly SASRec author's transformer blocks architecture but with pytorch Multi-Head Attention realisation.

	Number of self-attention blocks.
	Number of transformer blocks.

	Type of transformer layers used for training.
	Type of transformer layers architecture.

	User sessions consisting of items.
	User sessions in the form of sequences of items ids.

	User sessions with positional encoding if use_pos_emb is ``True``.
	Encoded user sessions with added positional encoding if `use_pos_emb` is ``True``.

	User interaction sequences.
	User sessions in the form of sequences of items ids.

		from tests.testing_utils import assert_id_map_equal, assert_interactions_set_equal


		@pytest.mark.filterwarnings("ignore::pytorch_lightning.utilities.warnings.PossibleUserWarning")

sasrec tests #193

Are you sure you want to change the base?

sasrec tests #193

Conversation

spirinamayya commented Sep 27, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment