Removes separate VISSL caching and adds file_name to torch.hub.load_state_dict_from_url everywhere #179

jonasd4 · 2024-09-10T12:58:26Z

This PR fixes further issues with torch.hub.load_state_dict_from_url

In every call to that function, the filename attribute is now set to avoid any unwanted caching.
The additional caching of vissl models is now removed (previously the assumption was that torch.hub.load_state_dict_from_url does not perform any caching)
The issue also affected loading the barlowtwins and vicreg models as in their respective hubconf.py also does not set the filename attribute and the filename in the url was the same. (e.g. https://github.com/facebookresearch/barlowtwins/blob/main/hubconf.py). These models are now directly loaded from the respective url while providing the filename attribute in the load function.

…_state_dict_from_url

codecov · 2024-09-10T13:17:10Z

Codecov Report

Attention: Patch coverage is 77.77778% with 2 lines in your changes missing coverage. Please review.

Project coverage is 76.26%. Comparing base (200e047) to head (c087a2a).
Report is 7 commits behind head on master.

Files with missing lines	Patch %	Lines
thingsvision/core/extraction/extractors.py	77.77%	0 Missing and 2 partials ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #179      +/-   ##
==========================================
- Coverage   76.30%   76.26%   -0.04%     
==========================================
  Files          40       40              
  Lines        2055     2056       +1     
  Branches      262      263       +1     
==========================================
  Hits         1568     1568              
  Misses        402      402              
- Partials       85       86       +1

Flag	Coverage Δ
unittests	`76.26% <77.77%> (-0.04%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

thingsvision/core/extraction/extractors.py

LukasMut · 2024-09-10T16:47:30Z

@jonasd4 Could you provide a meaningful description of the PR? It doesn't have to be long. Can be a one-liner.

lciernik

Docstring and filename extension clarification requested

lciernik · 2024-09-11T11:11:24Z

thingsvision/core/extraction/extractors.py

@@ -350,15 +350,14 @@ def __init__(
            device=device,
        )

-    def _download_and_save_model(self, model_url: str,
-                                 output_model_filepath: str, unique_model_id: str):
+    def _load_vissl_state_dict(self, model_url: str, unique_model_filename: str):
        """
        Downloads the model in vissl format, converts it to torchvision format and


Maybe adapt docstring and write that load_state_dict_from_url is using a cached version if available.

I adapted the docstring, please have a look if that's clear now!

lciernik · 2024-09-11T11:13:40Z

thingsvision/core/extraction/extractors.py

@@ -394,25 +392,25 @@ def load_model_from_source(self) -> None:
        Otherwise, loads it from the cache directory.
        """
        if self.model_name in SSLExtractor.MODELS:
+
+            # unique model id name for all models
+            unique_model_filename = f'thingsvision_ssl_v0_{self.model_name}'


The unique_model_filename no longer has an extension because it is not set in _load_vissl_state_dict nor in load_state_dict_from_url. Is this the desired behavior?

Yes, it is better with an extension! Added it now.

LukasMut

@jonasd4 LGTM but please add a short description before I approve.

jonasd4 · 2024-09-11T12:54:57Z

@jonasd4 LGTM but please add a short description before I approve.

Added the description!

lciernik

LGTM

removed vissl caching and added filename everywhere in torch.hub.load…

28faaa1

…_state_dict_from_url

jonasd4 added 2 commits September 10, 2024 15:35

fixed barlowtwins/vicreg model issue

cd80049

added map_location=cpu

9f8a6ce

a1247418 approved these changes Sep 10, 2024

View reviewed changes

thingsvision/core/extraction/extractors.py Show resolved Hide resolved

LukasMut requested review from a1247418, LukasMut and lciernik September 10, 2024 15:53

LukasMut assigned jonasd4 Sep 10, 2024

LukasMut added bug Something isn't working cleanup Deprecate or refactor code labels Sep 10, 2024

added map_location=cpu to other calls

06c092c

a1247418 approved these changes Sep 10, 2024

View reviewed changes

lciernik reviewed Sep 11, 2024

View reviewed changes

adapted docstring and added file name ending

c087a2a

LukasMut reviewed Sep 11, 2024

View reviewed changes

LukasMut requested a review from lciernik September 11, 2024 12:44

lciernik approved these changes Sep 11, 2024

View reviewed changes

LukasMut requested a review from a1247418 September 11, 2024 13:42

LukasMut added this pull request to the merge queue Sep 11, 2024

Merged via the queue into master with commit da9d717 Sep 11, 2024
4 of 5 checks passed

LukasMut approved these changes Sep 11, 2024

View reviewed changes

LukasMut deleted the fix/ssl-models-caching branch September 11, 2024 15:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Removes separate VISSL caching and adds file_name to torch.hub.load_state_dict_from_url everywhere #179

Removes separate VISSL caching and adds file_name to torch.hub.load_state_dict_from_url everywhere #179

jonasd4 commented Sep 10, 2024 •

edited

Loading

codecov bot commented Sep 10, 2024 •

edited

Loading

LukasMut commented Sep 10, 2024

lciernik left a comment

lciernik Sep 11, 2024

jonasd4 Sep 11, 2024

lciernik Sep 11, 2024

jonasd4 Sep 11, 2024

LukasMut left a comment

jonasd4 commented Sep 11, 2024

lciernik left a comment

Removes separate VISSL caching and adds file_name to torch.hub.load_state_dict_from_url everywhere #179

Removes separate VISSL caching and adds file_name to torch.hub.load_state_dict_from_url everywhere #179

Conversation

jonasd4 commented Sep 10, 2024 • edited Loading

codecov bot commented Sep 10, 2024 • edited Loading

Codecov Report

LukasMut commented Sep 10, 2024

lciernik left a comment

Choose a reason for hiding this comment

lciernik Sep 11, 2024

Choose a reason for hiding this comment

jonasd4 Sep 11, 2024

Choose a reason for hiding this comment

lciernik Sep 11, 2024

Choose a reason for hiding this comment

jonasd4 Sep 11, 2024

Choose a reason for hiding this comment

LukasMut left a comment

Choose a reason for hiding this comment

jonasd4 commented Sep 11, 2024

lciernik left a comment

Choose a reason for hiding this comment

jonasd4 commented Sep 10, 2024 •

edited

Loading

codecov bot commented Sep 10, 2024 •

edited

Loading