Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add ADPSFC prepbufr scripts to SPOC #7

Open
2 of 4 tasks
nicholasesposito opened this issue Oct 30, 2024 · 1 comment
Open
2 of 4 tasks

Add ADPSFC prepbufr scripts to SPOC #7

nicholasesposito opened this issue Oct 30, 2024 · 1 comment
Assignees

Comments

@nicholasesposito
Copy link

nicholasesposito commented Oct 30, 2024

Add ADPSFC prepbufr scripts to the SPOC repository.

These are the high-level scripts.

  • Build the backend
  • Write scripts
  • Compare to GSI diag and GSI IODA
  • Compare to low level outputs, generated by global workflow using gdasprepatmiodaobs

All of the comparisons are done here:
https://docs.google.com/presentation/d/1CQCtbzntg2hXruLKEqqEVuKsMizE8h2DjPmAcka29eg/edit?pli=1#slide=id.g2fc02b94591_0_31

Notes:

  • These high level scripts obtain data that is obstypes 181, 183, 187, 281, 284, 287. Obstypes 183 and 284 are QC'd out but are still present in the data so we keep them. However, they are not plotted in the plots of the comparisons.

  • There are slightly fewer obs in the high level backend outputs compared to GSI IODA and the low level API output. This is because the scripts I wrote are inclusively bounded in the beginning but not the end (-3 hours from cycle time to +2.999999 hours from cycle time). GSI and the low level api are inclusive at -3 hours and + 3 hours, though this may be a bug in the GSI as it is not supposed to include up to +3 and should end at +2.99999. EDIT: This was indeed a bug. I made changes to my GSI so if the time was +3 hours, it was cycled out.

  • The three MetaData variables are height (ZOB mnemonic), stationElevation (ELV mnemonic) and pressure.
    stationPressure is the one variable that is simulated.
    stationPressure and pressure are the Same. It is called pressure in MetaData and stationPressure in ObsValue to be consistent among other data types.

  • For height, GSI has many more datapoints than the backend (almost double) because of the way GSI records data. There is a difference in the mean of 4m and in standard deviation of 1.2m

  • For stationElevation, backend has more 2200 more datapoints (out of 143700) but the mean difference is .7 m and the standard deviation difference is 0.9 m.

  • I’m attributing the differences in height and stationElevation to ELV being on the main mnemonic branch while ZOB is a replication.
    pressure and stationPressure are the same. The GSI outputs (diag and nc) have less datapoints compared to backend(~72500 compared to ~143800). The difference in means is only 30 Pa (0.3 mb) and for standard deviation 300 Pa (3mb).

  • sequenceNumber added to MetaData as an array of 0s.

@nicholasesposito nicholasesposito self-assigned this Oct 30, 2024
@nicholasesposito nicholasesposito changed the title Add ADPSFC to SPOC Add ADPSFC prepbufr scripts to SPOC Oct 30, 2024
@nicholasesposito
Copy link
Author

Added dateTime and cycle_time to py. dateTime is computed in mapping for backend, unless script is used. If script, re-computes. cycle_time comes from yaml and is used to compute dateTime. minDateTime calculated. All scripts edited to be like Emily's suggestions, input names and output names finalized, attributes good. Now using original prepbufr file (gdas.t??z.prepbufr) and using subsets from yaml to determine which data to grab, rather than splitting into a separate file. Deleted yamls that use 'mpi0' since we don't want it in output filename if mpi is not used.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant