Use xarray-beam to append derived Replay variables #24
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This adds two python scripts that allow one to append derived variables to the smaller 1 degree and subsampled 1/4 degree Replay datasets.
Example usage can be found in
submit_geopotential_append.sh
, and it's very similar for the static variables.Scaling up to append variables to the 1/4 degree datasets is future work, note that I couldn't even append the static variables to the 1/4 degree dataset. It seems like the steps of just opening the dataset and building the graph associated with appending to it already takes too much memory for a c2 standard 60 node.
The localzarr.py file has some modifications to the xarray_beam.zarr module necessary for appending variables. I'd like to add this to that code base if the developers are interested, but I thought this would be a good place to start.