`OpenEphysBinaryRawIO`: Separate Neural and Non-Neural Data into Distinct Streams #1624

h-mayorquin · 2025-01-16T18:51:14Z

The continuous.dat file in the Open Ephys Binary format often contains both neural and non-neural (ADC) channels. For detailed documentation, refer to the Open Ephys GUI User Manual.

An example dataset (provided by @rly) illustrating this structure is available here. Additionally, a related discussion can be found in the NeuroConv repository: Issue #1023. I think this is the module that the test set used

This pull request modifies the reader to separate non-neural channels into their own stream. This approach allows users to clearly distinguish between neural and non-neural data and will enable programmatic separation in downstream libraries like SpikeInterface and Neuroconv.

Implementation Details:

To maintain backward-compatibility of stream_ids, the non-neural data's stream_id is defined as the neural data's stream_id plus the total number of available streams. For example, if the neural data has a stream_id of "1" and there are three streams in total, the non-neural data will be assigned a stream_id of "4".

The stream_name for non-neural data is derived by appending _ADC to the neural data's stream_name. For instance, if the neural data's stream_name is Data, the non-neural data's stream_name will be DATA_ADC.

I welcome better suggestions for conventions of naming and any other type of feedback from people who are more familiar with this format @alejoe91

h-mayorquin · 2025-01-16T19:40:07Z

Testing data is here:

https://gin.g-node.org/NeuralEnsemble/ephy_testing_data/pulls/150

Once the data is merged I can add a test

alejoe91 · 2025-01-17T08:01:22Z

@h-mayorquin can you fix Ryan's last name here? https://gin.g-node.org/NeuralEnsemble/ephy_testing_data/pulls/150#issuecomment-5561

Se we can merge and add a test here

neo/rawio/openephysbinaryrawio.py

samuelgarcia · 2025-01-17T11:11:44Z

neo/rawio/openephysbinaryrawio.py

+        if stream_index >= len(self._sig_streams[block_index][seg_index]):
+            stream_index = stream_index - len(self._sig_streams[block_index][seg_index])    
        t_start = self._sig_streams[block_index][seg_index][stream_index]["t_start"]


I understand correctly the ptach stream_index is not anymore a stream_index but an internal variable which is not a direct mapping to stream_index.
self._sig_streams[block_index][seg_index][stream_index]["t_start"]

If this is the case I would change this internal variable to something else to avoid mistale when reading the entire code.

Maybe I am mistunderstanding or not clear at all but this writting is a bit weird to me.

You are correct. This is a good idea.

Co-authored-by: Alessio Buccino <[email protected]>

h-mayorquin · 2025-01-17T12:30:15Z

I corrected Ryan's last name (my bad) on the test data repo.

Question: why do we have stream_id of the synch channel as the empty string? It is loaded with the neural data of the continuous event with a flag, shouldn't they belong to the same stream?

alejoe91 · 2025-01-17T12:32:34Z

Gin is merged!

alejoe91 · 2025-01-17T13:48:03Z

@h-mayorquin so just missing the test with the new test file and this is ready for final review, right?

h-mayorquin · 2025-01-17T13:59:12Z

This is ready for final review now.

h-mayorquin · 2025-01-17T14:07:36Z

neo/rawio/openephysbinaryrawio.py

+                        stream_id_neural = stream_id
+                        stream_id_non_neural = str(int(stream_id) + self._num_of_signal_streams)
+
+                        # Note this implementation assumes that the neural channels come before the non-neural channels


Here guys, these are the strongest assumptions that the reader makes. I wanted to check with @alejoe91 if he knows about this:

Is it true that the synch trace will always be the last channel even if you have non-neural channels? I could not find this in the documentation.

Are the neural channels and non-neural always in order and not interleaved?

If the first case is not True, we can extract the specific index from the structure.oebin. That's a bigger change but I think doable.

For the second case, if it is not true, I am afraid we will have to error somehow as the current data model does not support non-contigious streams across the buffer with the buffer slice mechanism (@samuelgarcia )

I am adding error messages for this advising people to open an issue if they have interleaved neural and non-neural or synch trace that is not at the end.

alejoe91 · 2025-01-17T14:45:09Z

@h-mayorquin some tests are failing. This is because the array annotations have changed now, since also they need to be split across streams

h-mayorquin · 2025-01-17T14:53:36Z

@h-mayorquin some tests are failing. This is because the array annotations have changed now, since also they need to be split across streams

I fixed them now. The array annotations were not divided for the stream.

h-mayorquin · 2025-01-17T15:17:04Z

@alejoe91 tests are passing but the docs are failing for some reason (they have failed intermittently through this PR) not sure if related to this. Updating the branch to check.

zm711

Just a few comments. I know Alessio and Sam provide better feedback for openephys so take what you like and ignore what you don't :)

zm711 · 2025-01-24T21:03:12Z

neo/rawio/openephysbinaryrawio.py

+                    # For ADC channels multiplying by the bit_volts when units are not provided converts to Volts
+                    units = "V" if units == "" else units
+
+                gain = chan_info["bit_volts"]


is this a safe float? or do we need to convert it?

What do you mean safe? as in numpy?

Yes. I mean is it a numpy scalar (I should have written that better :) ) or is it a python float?

For floats, there is no difference I think right? We only have to be concerned about integers. I think np.float is the same thing as float in python (from my memory).

I changed it anyway I guess null casting should have no cost.

So I looked it up mostly from stackoverflow discussion
https://stackoverflow.com/questions/40726490/overflow-error-in-pythons-numpy-exp-function
and reddit
https://www.reddit.com/r/learnpython/comments/feakn2/numpy_overflow_help_needed/?rdt=48979
And seems like it can overflow, but honestly the issue of float overflow seems to be more benign and casting to python seems to cost precision that honestly shouldn't really matter right? But float precision is so OS dependent that I think requiring that level of precision can't really be expected from floats. Appreciate the casting though, but in general your intuition/memory of this being an int thing seems to be true.

neo/rawio/openephysbinaryrawio.py

zm711 · 2025-01-24T21:08:23Z

neo/rawio/openephysbinaryrawio.py

        for stream_index, stream_name in enumerate(sig_stream_names):
-            # stream_index is the index in vector sytream names
+            # stream_index is the index in vector stream names


zm711 · 2025-01-24T21:09:45Z

neo/rawio/openephysbinaryrawio.py

+                    # We defined their stream_id as the stream_index of neural data plus the number of neural streams
+                    # This is to not break backwards compatbility with the stream_id numbering
+                    stream_id = str(stream_index + len(sig_stream_names))
+                    # For ADC channels multiplying by the bit_volts when units are not provided converts to Volts


What do you mean here?

the way it is written sounds like if units are provided then we SHOULD NOT worry about using the gain,? So is the gain 1 in this case and our code is fine?

Yes, that's not very clear without context. More context: When the units are not provided the units of ADC are Volts. This is in opposition to the neural channels where if the units are not provided the units are microvolts.

https://open-ephys.github.io/gui-docs/User-Manual/Recording-data/Binary-format.html#continuous

But the units are always the units of the data AFTER you use the gain. Only the unitless(units=="") case differs between neural and non-neural channels. Microvolts for the former, volts for the latter.

Okay, yeah that makes sense in the context (and is the same with intan so ADC being volts and neural being microvolts seems common enough.

Re-wrote the whole thing to separate unit determination from stream determination and I think it reads better now.

Yeah I agree it makes it clearer to me at least!

neo/rawio/openephysbinaryrawio.py

fix to add ADC channels

7a51577

alejoe91 self-requested a review January 16, 2025 19:35

alejoe91 self-assigned this Jan 17, 2025

alejoe91 reviewed Jan 17, 2025

View reviewed changes

neo/rawio/openephysbinaryrawio.py Outdated Show resolved Hide resolved

samuelgarcia reviewed Jan 17, 2025

View reviewed changes

h-mayorquin and others added 2 commits January 17, 2025 06:18

Update neo/rawio/openephysbinaryrawio.py

9b41dd2

Co-authored-by: Alessio Buccino <[email protected]>

sam suggestion

41d17eb

h-mayorquin added 2 commits January 17, 2025 07:57

add test and fix buffer slice

009861f

do list comparison

7474f42

h-mayorquin marked this pull request as ready for review January 17, 2025 13:59

fix comparison

b5b63d8

h-mayorquin commented Jan 17, 2025

View reviewed changes

add errors

84efcb8

fix annotations

ebebd29

h-mayorquin added 5 commits January 17, 2025 09:17

Merge branch 'master' into fix_openephys_stream

34b6616

trigger tests

81f85c3

Merge branch 'master' into fix_openephys_stream

e7c3244

Merge branch 'master' into fix_openephys_stream

e11e12c

Merge branch 'master' into fix_openephys_stream

19b76e3

zm711 reviewed Jan 24, 2025

View reviewed changes

zach feedback

e92de38

Merge branch 'master' into fix_openephys_stream

faba17d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`OpenEphysBinaryRawIO`: Separate Neural and Non-Neural Data into Distinct Streams #1624

`OpenEphysBinaryRawIO`: Separate Neural and Non-Neural Data into Distinct Streams #1624

h-mayorquin commented Jan 16, 2025 •

edited

Loading

h-mayorquin commented Jan 16, 2025

alejoe91 commented Jan 17, 2025

samuelgarcia Jan 17, 2025

h-mayorquin Jan 17, 2025

h-mayorquin Jan 17, 2025

h-mayorquin commented Jan 17, 2025

alejoe91 commented Jan 17, 2025

alejoe91 commented Jan 17, 2025

h-mayorquin commented Jan 17, 2025

h-mayorquin Jan 17, 2025

h-mayorquin Jan 17, 2025

alejoe91 commented Jan 17, 2025

h-mayorquin commented Jan 17, 2025

h-mayorquin commented Jan 17, 2025

zm711 left a comment

zm711 Jan 24, 2025

h-mayorquin Jan 27, 2025

zm711 Jan 27, 2025

h-mayorquin Jan 27, 2025 •

edited

Loading

h-mayorquin Jan 27, 2025

zm711 Jan 27, 2025

zm711 Jan 24, 2025

zm711 Jan 24, 2025

h-mayorquin Jan 27, 2025

zm711 Jan 27, 2025

h-mayorquin Jan 27, 2025

zm711 Jan 27, 2025

OpenEphysBinaryRawIO: Separate Neural and Non-Neural Data into Distinct Streams #1624

Are you sure you want to change the base?

OpenEphysBinaryRawIO: Separate Neural and Non-Neural Data into Distinct Streams #1624

Conversation

h-mayorquin commented Jan 16, 2025 • edited Loading

h-mayorquin commented Jan 16, 2025

alejoe91 commented Jan 17, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

h-mayorquin commented Jan 17, 2025

alejoe91 commented Jan 17, 2025

alejoe91 commented Jan 17, 2025

h-mayorquin commented Jan 17, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

alejoe91 commented Jan 17, 2025

h-mayorquin commented Jan 17, 2025

h-mayorquin commented Jan 17, 2025

zm711 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

h-mayorquin Jan 27, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

`OpenEphysBinaryRawIO`: Separate Neural and Non-Neural Data into Distinct Streams #1624

`OpenEphysBinaryRawIO`: Separate Neural and Non-Neural Data into Distinct Streams #1624

h-mayorquin commented Jan 16, 2025 •

edited

Loading

h-mayorquin Jan 27, 2025 •

edited

Loading