Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reprocess and annotate OpenNeuro with SNOMED assessment terms #213

Open
11 tasks
surchs opened this issue Jan 6, 2025 · 0 comments
Open
11 tasks

Reprocess and annotate OpenNeuro with SNOMED assessment terms #213

surchs opened this issue Jan 6, 2025 · 0 comments
Labels
Milestone Used to track other issues that are required to complete the milestone.

Comments

@surchs
Copy link
Contributor

surchs commented Jan 6, 2025

Context

We now support derivatives. OpenNeuro has derivatives (https://github.com/OpenNeuroDerivatives/OpenNeuroDerivatives), so we should add them. We are also changing the vocabulary for assessment tools to SNOMED in #202 and we will need to reprocess the existing OpenNeuro annotations to make sure we can still discover them

Why

  • to keep OpenNeuro annotations in sync with our developing data models
  • to make sure users can come to query.neurobagel.org to search for derivatives (not possible elsewhere atm)
  • to test out in practice our new process for automatically processing OpenNeuro datasets
  • so we are ready for users to add these annotations and then have them be reflected in our query quickly

Outcomes

  • All currently annotated OpenNeuro datasets have a data dictionary that uses SNOMED terms for assessment tools (where applicable)
  • There is an automatic process that generates new .jsonld files for each OpenNeuro dataset for which we update the data dictionary
  • This automatic process also includes the derivatives (not sure if automatically extracted or semi-manually)
  • The new .jsonld files are reuploaded to our OpenNeuro graph

potential issue list (temporary)

These issues should be moved to sub-issues

  • review current process for updating the OpenNeuro node (how do we re-generate all the .jsonld)
  • make sure all workflows on OpenNeuro-JSONLD still run (especially the ones merging upstream and creating new repos)
  • Make a list of all used cogatlas terms in OpenNeuro (from graph or from dictionaries) and see how many of them have an equivalent in SNOMED
  • create a process to index and keep up to date the list of ON datasets with derivatives. also figure out what kinds of pipelines (and versions) they have been run with (might require email)
  • update the annotations of all OpenNeuro datasets we currently have annotated
  • re-generate the graph files for the OpenNeuro node
  • Redeploy the OpenNeuro node with the up to date tools
@surchs surchs added the Milestone Used to track other issues that are required to complete the milestone. label Jan 6, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Milestone Used to track other issues that are required to complete the milestone.
Projects
Status: No status
Development

No branches or pull requests

1 participant