New types of datasets supported for Delmic HDF5 format #328

noemiebonnet · 2024-10-28T18:54:47Z

New supported formats: intensity, hyperspectral, angle-resolved, E-k, time-resolved.

Progress of the PR

New supported formats: intensity, hyperspectral, angle-resolved, E-k, time-resolved.

rsciio/delmic/_api.py

+def get_unit_prefix(number):
+    """Return the SI prefix for the given number based on its magnitude."""
+    if 1e-15 < np.abs(number) < 1e-12:
+        prefix = "f"


rsciio/delmic/_api.py

+
+        # Ensure SE Image shape is compatible by taking the last two dimensions
+        if len(SE_Image.shape) >= 2:
+            Y, X = SE_Image.shape[-2:]


rsciio/delmic/_api.py

+
+        # Ensure SE Image shape is compatible by taking the last two dimensions
+        if len(SE_Image.shape) >= 2:
+            Y, X = SE_Image.shape[-2:]


codecov · 2024-10-28T19:00:05Z

Codecov Report

Attention: Patch coverage is 62.60163% with 138 lines in your changes missing coverage. Please review.

Project coverage is 87.02%. Comparing base (88923bf) to head (0a3e897).
Report is 31 commits behind head on main.

Files with missing lines	Patch %	Lines
rsciio/delmic/_api.py	62.60%	85 Missing and 53 partials ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #328      +/-   ##
==========================================
- Coverage   87.84%   87.02%   -0.83%     
==========================================
  Files          85       85              
  Lines       11180    11534     +354     
  Branches     2280     2378      +98     
==========================================
+ Hits         9821    10037     +216     
- Misses        860      945      +85     
- Partials      499      552      +53

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

noemiebonnet · 2024-10-29T12:51:29Z

pre-commit.ci autofix

jlaehne

Thanks @noemiebonnet for the substantial progress! I left a few comments from a first browsing over the code, but did not yet have time for a closer look. Please include the standard tracking list in the initial comment to get an overview of how far the PR is.

There are still a number of lines uncovered by tests, as commented by codecov. If lines should explicitly be ignored in the coverage test, one can add the comment # pragma: no cover at the end of the line. I see that most uncovered lines concern warnings or errors that might surface during file reading. It might be hard to test for those if one has only proper test files. What is our current best-practice in that case @ericpre? (Note that you can also get inspiration on such cases from other files readers recently implemented, such as Horiba JobinYvon or Hamamatsu).

doc/user_guide/supported_formats/delmic.rst

rsciio/delmic/_api.py

rsciio/tests/test_delmic.py

rsciio/delmic/_api.py

jlaehne · 2024-11-05T08:47:03Z

A single spectrum is still loaded as <Signal1D, title: , dimensions: (1, 1|2048)>, but should be <Signal1D, title: , dimensions: (|2048)> (drop the navigation dimensions).

rsciio/delmic/_api.py

jlaehne · 2024-11-13T07:37:09Z

rsciio/delmic/_api.py

+    the associated image type.
+    """
+    if Acq is None:
+        raise TypeError(


Currently, all the various loading errors are giving codecov warnings, because they are not included in the tests. In hdf5, it should be rather straight forward to create some minimal test files for each case where certain elements are deleted from the file to test each of these cases. An idea could also be to use hdf5 files from some other rsciio plugins that do not match the delmic specifications to test some of the errors.

jlaehne · 2024-11-13T07:41:52Z

rsciio/delmic/_api.py

+
+    # Check if Image has 5 dimensions
+    if Image.ndim != 5:
+        raise ValueError("The input 'Image' must be a 5D h5 dataset.")


I see this check and the one above repeated in all different load functions. Are all delmic hdf5 files 5D datasets? Then it could make sense to check for this once on a higher level instead of having the same checks in every datatype function.

jlaehne · 2024-11-13T07:46:57Z

rsciio/delmic/_api.py

+        or Image.shape[3] < 1
+        or Image.shape[4] < 1
+    ):
+        raise ValueError(


As an example for testing errors, here you could use the file from a different type of data and change the img_type to get a case that triggers this error.

jlaehne · 2024-11-13T07:54:50Z

rsciio/delmic/_api.py

+            Scale = np.array(ImgData.get(scale_key))
+
+            if axis_name in ["C", "T"]:
+                scale_value = np.mean(


Some other readers have a parameter to choose whether to read such axes as a UniformDataAxis by calculating the mean scale, or as a non-uniform DataAxis by just taking exactly the vector that is in the file.

see e.g.:

rosettasciio/rsciio/jobinyvon/_api.py

Line 684 in 1274ed5

use_uniform_signal_axis : bool, default=False

jlaehne · 2024-11-13T08:19:37Z

rsciio/delmic/_api.py

+        metadata["Signal"]["quantity"] = "Intensity (counts)"  # Default value
+
+    if "signal_type" in Acq:
+        metadata["Signal"]["signal_type"] = Acq["signal_type"]


signal_type is used to select the right signal class. If LumiSpy is installed, it should default to the correct signal classes by choosing Luminescence for anything where the only signal axis is wavelength (spectra or hyperspectral maps), LumiTransient if the only signal axis is time and LumiTransientSpectrum for streak camera images.

See e.g.

rosettasciio/rsciio/jobinyvon/_api.py

Line 85 in 1274ed5

if self._lumispy_installed:

rosettasciio/rsciio/hamamatsu/_api.py

Line 362 in 332412f

signal["signal_type"] = "LumiTransientSpectrum" # pragma: no cover

jlaehne · 2024-11-13T08:29:34Z

rsciio/delmic/_api.py

+    return metadata
+
+
+def make_original_metadata(Acq):


original_metadata should be 1:1 the metadata tree from the file and can be automatically parsed for hdf5 files, see e.g.

rosettasciio/rsciio/nexus/_api.py

Line 828 in 332412f

def _load_metadata(group, lazy=False, skip_array_metadata=False):

metadata should contain a curated subset of metadata with predefined key names that follows the LumiSpy/HyperSpy conventions:
https://docs.lumispy.org/en/stable/user_guide/metadata_structure.html

As a note, HyperSpy's load function takes the parameter load_original_metadata that can be used to only parse the curated metadata and drop the object original_metadata in case it is very large:
https://hyperspy.org/hyperspy-doc/current/user_guide/io.html#metadata

New types of datasets supported for Delmic HDF5 format

0a3e897

New supported formats: intensity, hyperspectral, angle-resolved, E-k, time-resolved.

github-advanced-security bot found potential problems Oct 28, 2024

View reviewed changes

jlaehne added this to the v0.7 milestone Nov 3, 2024

jlaehne added status: needs review type: enhancement labels Nov 3, 2024

jlaehne reviewed Nov 3, 2024

View reviewed changes

jlaehne reviewed Nov 13, 2024

View reviewed changes

jlaehne added status: waiting for author and removed status: needs review labels Nov 13, 2024

jlaehne mentioned this pull request Dec 12, 2024

Making a New Release #344

Open

ericpre modified the milestones: v0.7, v0.8 Dec 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New types of datasets supported for Delmic HDF5 format #328

New types of datasets supported for Delmic HDF5 format #328

noemiebonnet commented Oct 28, 2024 •

edited by jlaehne

Loading

codecov bot commented Oct 28, 2024 •

edited

Loading

noemiebonnet commented Oct 29, 2024

jlaehne left a comment

jlaehne commented Nov 5, 2024

jlaehne Nov 13, 2024

jlaehne Nov 13, 2024

jlaehne Nov 13, 2024

jlaehne Nov 13, 2024

jlaehne Nov 13, 2024

jlaehne Nov 13, 2024

jlaehne Nov 13, 2024

New types of datasets supported for Delmic HDF5 format #328

Are you sure you want to change the base?

New types of datasets supported for Delmic HDF5 format #328

Conversation

noemiebonnet commented Oct 28, 2024 • edited by jlaehne Loading

Progress of the PR

codecov bot commented Oct 28, 2024 • edited Loading

Codecov Report

noemiebonnet commented Oct 29, 2024

jlaehne left a comment

Choose a reason for hiding this comment

jlaehne commented Nov 5, 2024

jlaehne Nov 13, 2024

Choose a reason for hiding this comment

jlaehne Nov 13, 2024

Choose a reason for hiding this comment

jlaehne Nov 13, 2024

Choose a reason for hiding this comment

jlaehne Nov 13, 2024

Choose a reason for hiding this comment

jlaehne Nov 13, 2024

Choose a reason for hiding this comment

jlaehne Nov 13, 2024

Choose a reason for hiding this comment

jlaehne Nov 13, 2024

Choose a reason for hiding this comment

noemiebonnet commented Oct 28, 2024 •

edited by jlaehne

Loading

codecov bot commented Oct 28, 2024 •

edited

Loading