Add ability to specify generated time series file output path #117

nusbaume · 2024-07-16T22:06:17Z

Fixes #116

All Submissions:

Have you followed the guidelines in our Contributer's Guide](https://github.com/NCAR/CUPiD/wiki/Contributor's-Guide) (including the pre-commit check)?
Have you checked to ensure there aren't other open Pull Requests for the same update/change?

New Feature Submissions:

Does your submission pass tests?
Have you lint your code locally prior to submission?

Changes to Core Features:

Have you added an explanation of what your changes do and why you'd like us to include them?
Have you written new tests for your core changes, as applicable?
Have you successfully ran tests with your changes locally?

for user to control time series generator output location."

…erator.

nusbaume · 2024-07-16T22:18:57Z

@mnlevy1981 @TeaganKing please let me know what tests you want to have run with this branch as specified in the checklist above (I obviously tested it in my own config file). Thanks!

TeaganKing · 2024-07-17T17:15:04Z

Hi @nusbaume , We don't have robust testing set up yet (that PR template checklist was a bit preemptive), so I think just testing with your own config file is great.

TeaganKing · 2024-07-17T17:17:08Z

It looks like the pre-commit is also passing. Whenever it's ready, feel free to request a review.

nusbaume · 2024-07-17T20:33:09Z

@TeaganKing thanks for letting me know! Sadly it looks like I am unable to add reviewers to this PR.

mnlevy1981

This looks great! Just one small request and then a quick question that will lead to a second small request regardless of your answer :)

mnlevy1981 · 2024-07-17T20:46:03Z

cupid/run.py

+                ts_output_dir = [
+                    os.path.join(
+                        global_params["CESM_output_dir"],
+                        timeseries_params["case_name"],
+                        f"{component}", "proc", "tseries",
+                    ),
+                ]
+
+                if "ts_output_dir" in timeseries_params:
+                    ts_output_dir = [
+                        os.path.join(
+                            timeseries_params["ts_output_dir"],
+                            f"{component}", "proc", "tseries",
+                        ),
+                    ]


One suggestion: can we keep ts_output_dir as a string, and pass [ts_output_dir] to cupid.timeseries.create_time_series()? (A separate issue ticket might be to update create_time_series to accept strings or lists as arguments, so list-ifying a handful of arguments seems silly... but that will be easier to address if ts_output_dir is a string.)

Also, a question: an alternate approach to this block of code would be

if "ts_output_dir" in timeseries_params: ts_output_dir =os.path.join( timeseries_params["ts_output_dir"], f"{component}", "proc", "tseries", ) else: ts_output_dir = os.path.join( global_params["CESM_output_dir"], timeseries_params["case_name"], f"{component}", "proc", "tseries", )

Is there a reason to prefer one implementation over the other? If you want to keep

var=value if condition: var=other_value

I see one if-else block in run.py, could you change that one?

# Doing initial subsetting on full catalog, e.g. to only use certain cases + cat_path = full_cat_path if "subset" in control["data_sources"]: first_subset_kwargs = control["data_sources"]["subset"] cat_subset = full_cat.search(**first_subset_kwargs) # This pulls out the name of the catalog from the path cat_subset_name = full_cat_path.split("/")[-1].split(".")[0] + "_subset" cat_subset.serialize( directory=temp_data_path, name=cat_subset_name, catalog_type="file", ) cat_path = temp_data_path + "/" + cat_subset_name + ".json" - else: - cat_path = full_cat_path

I am happy with either method of writing the if-statement. I personally prefer the if-else method, but have found that some static analysis tools like pylint will complain about it, hence I went the var=value, if condition: var=other_value way in this PR. Do you all have a preference one way or the other?

Let's stick with if-else, and if we eventually add pylint or some other checker that complains we'll just have one more block to update.

Sounds good! I've applied the requested changes.

mnlevy1981

looks great!

nusbaume added 3 commits July 16, 2024 15:12

Add new optional 'ts_output_dir' config variable to allow

ce84290

for user to control time series generator output location."

Apply changes found by pre-commit.

595e4d6

Apply os.path.join method to remaining directories in time series gen…

42bd1db

…erator.

mnlevy1981 requested changes Jul 17, 2024

View reviewed changes

Apply review requests.

91c1698

nusbaume requested a review from mnlevy1981 July 17, 2024 22:40

mnlevy1981 approved these changes Jul 18, 2024

View reviewed changes

mnlevy1981 merged commit 644ca18 into NCAR:main Jul 18, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ability to specify generated time series file output path #117

Add ability to specify generated time series file output path #117

nusbaume commented Jul 16, 2024 •

edited

Loading

nusbaume commented Jul 16, 2024

TeaganKing commented Jul 17, 2024

TeaganKing commented Jul 17, 2024

nusbaume commented Jul 17, 2024

mnlevy1981 left a comment

mnlevy1981 Jul 17, 2024

nusbaume Jul 17, 2024

mnlevy1981 Jul 17, 2024

nusbaume Jul 17, 2024

mnlevy1981 left a comment

Add ability to specify generated time series file output path #117

Add ability to specify generated time series file output path #117

Conversation

nusbaume commented Jul 16, 2024 • edited Loading

All Submissions:

New Feature Submissions:

Changes to Core Features:

nusbaume commented Jul 16, 2024

TeaganKing commented Jul 17, 2024

TeaganKing commented Jul 17, 2024

nusbaume commented Jul 17, 2024

mnlevy1981 left a comment

Choose a reason for hiding this comment

mnlevy1981 Jul 17, 2024

Choose a reason for hiding this comment

nusbaume Jul 17, 2024

Choose a reason for hiding this comment

mnlevy1981 Jul 17, 2024

Choose a reason for hiding this comment

nusbaume Jul 17, 2024

Choose a reason for hiding this comment

mnlevy1981 left a comment

Choose a reason for hiding this comment

nusbaume commented Jul 16, 2024 •

edited

Loading