As a user, I want to sync ESA PSA products from the Search API #135

jordanpadams · 2024-09-28T18:45:39Z

Checked for duplicates

Yes - I've already checked

🧑‍🔬 User Persona(s)

Data User

💪 Motivation

...so that I can have the ESA PSA products available through the Solr search

📖 Additional Details

I think the easiest route to do this is to have a separate Python script to download the XML to a local archive, and then execute harvest to load it into the legacy registry

Acceptance Criteria

Given a Registry Search API loaded with ESA PSA context products
When I perform pds-sync-api --node-name psa --download-path /path/to/download/XML
Then I expect a python script to download the XML files to --download-path (if they do not already exist)

⚙️ Engineering Details

query Search API for all PSA context products, bundles, collections
paginate through the results
- check if the LIDVID has already been loaded into the Registry or not
- if not, check if the XML is already in --download-path (using file name and ops:Label_File_Info.ops:md5_checksum)
- if the file does not exist, download to --download-path
- execute harvest on those XML

This will be a two-part ticket since we will then need a bash script to be added to this repo to actually execute harvest on the downloaded data.

🎉 I&T

No response

The text was updated successfully, but these errors were encountered:

jordanpadams · 2024-09-28T18:59:39Z

@nutjob4life not sure of the best place to put this script. it can either go somewhere here (with a requirements.txt), our operations repo (which contains a bunch of ad-hoc scripts), or ?

tloubrieu-jpl · 2024-10-10T21:10:44Z

45.66666% done

Sync ESA-PSA label files to a local directory

jordanpadams added needs:triage requirement the current issue is a requirement labels Sep 28, 2024

jordanpadams self-assigned this Sep 28, 2024

jordanpadams added this to EN Portfolio Backlog Sep 28, 2024

github-project-automation bot moved this to ToDo in EN Portfolio Backlog Sep 28, 2024

jordanpadams assigned nutjob4life and unassigned jordanpadams Sep 28, 2024

jordanpadams added B15.1 sprint-backlog and removed needs:triage labels Sep 28, 2024

jordanpadams added this to B15.1 Sep 28, 2024

github-project-automation bot moved this to Release Backlog in B15.1 Sep 28, 2024

jordanpadams added the p.must-have label Sep 28, 2024

jordanpadams assigned jordanpadams and nutjob4life and unassigned nutjob4life and jordanpadams Sep 28, 2024

nutjob4life mentioned this issue Oct 11, 2024

Sync ESA-PSA label files to a local directory NASA-PDS/operations#557

Merged

nutjob4life mentioned this issue Oct 21, 2024

As a user, I want to search for ESA data sets and context products NASA-PDS/portal-wp#31

Open

jordanpadams added a commit to NASA-PDS/operations that referenced this issue Oct 31, 2024

Merge pull request #557 from NASA-PDS/registry-legacy-solr#135

7e14add

Sync ESA-PSA label files to a local directory

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

As a user, I want to sync ESA PSA products from the Search API #135

As a user, I want to sync ESA PSA products from the Search API #135

jordanpadams commented Sep 28, 2024 •

edited

Loading

jordanpadams commented Sep 28, 2024

tloubrieu-jpl commented Oct 10, 2024

As a user, I want to sync ESA PSA products from the Search API #135

As a user, I want to sync ESA PSA products from the Search API #135

Comments

jordanpadams commented Sep 28, 2024 • edited Loading

Checked for duplicates

🧑‍🔬 User Persona(s)

💪 Motivation

📖 Additional Details

Acceptance Criteria

⚙️ Engineering Details

🎉 I&T

jordanpadams commented Sep 28, 2024

tloubrieu-jpl commented Oct 10, 2024

jordanpadams commented Sep 28, 2024 •

edited

Loading