40 add logging aggregation, LOGLEVEL env var, other logging tweaks #45

alexdunnjpl · 2023-07-26T21:35:09Z

🗒️ Summary

Tweaks logging in a handful of ways (see commit messages)

Biggest one is that product-level errors/warns are now aggregated, while maintaining explanatory details for troubleshooting.

Log level can now be set in dockerized processes via the LOGLEVEL environment variable

⚙️ Test Data and/or Report

Unit tests pass

♻️ Related Issues

fixes #40

…r reason this prevents the logs from getting blasted when batches full of similar errors present. does not aggregate when the product id appears in the reason

takes an int or string representation of a python standard log level like INFO

sonatype-lift · 2023-07-26T21:35:12Z

Sonatype Lift is retiring

Sonatype Lift will be retiring on Sep 12, 2023, with its analysis stopping on Aug 12, 2023. We understand that this news may come as a disappointment, and Sonatype is committed to helping you transition off it seamlessly. If you’d like to retain your data, please export your issues from the web console.
We are extremely grateful and thank you for your support over the years.

📖 Read about the impacts and timeline

sonatype-lift · 2023-07-26T21:38:05Z

docker/sweepers_driver.py

@@ -92,6 +92,7 @@
    logging.error(err)
    raise ValueError(f'Failed to parse username/password from PROV_CREDENTIALS value "{provCredentialsStr}": {err}')

+log_level = parse_log_level(os.environ.get('LOGLEVEL', 'INFO'))

 def run_factory(sweeper_f: Callable) -> Callable:


E302: expected 2 blank lines, found 1

ℹ️ Expand to see all @sonatype-lift commands

You can reply with the following commands. For example, reply with @sonatype-lift ignoreall to leave out all findings.

Command Usage

@sonatype-lift ignore Leave out the above finding from this PR

@sonatype-lift ignoreall Leave out all the existing findings from this PR

@sonatype-lift exclude <file|issue|path|tool> Exclude specified file|issue|path|tool from Lift findings by updating your config.toml file

Note: When talking to LiftBot, you need to refresh the page to see its response.
_{Click here to add LiftBot to another repo.}

sonatype-lift · 2023-07-26T21:38:07Z

docker/sweepers_driver.py

@@ -62,7 +62,7 @@
 from typing import Callable, Iterable

 from pds.registrysweepers import provenance, ancestry
-from pds.registrysweepers.utils import configure_logging, get_human_readable_elapsed_since
+from pds.registrysweepers.utils import configure_logging, get_human_readable_elapsed_since, parse_log_level


reportMissingImports: Import "pds.registrysweepers.utils" could not be resolved

ℹ️ Expand to see all @sonatype-lift commands

You can reply with the following commands. For example, reply with @sonatype-lift ignoreall to leave out all findings.

Command Usage

@sonatype-lift ignore Leave out the above finding from this PR

@sonatype-lift ignoreall Leave out all the existing findings from this PR

@sonatype-lift exclude <file|issue|path|tool> Exclude specified file|issue|path|tool from Lift findings by updating your config.toml file

Note: When talking to LiftBot, you need to refresh the page to see its response.
_{Click here to add LiftBot to another repo.}

tloubrieu-jpl

Some comments. Thanks @alexdunnjpl

tloubrieu-jpl · 2023-07-26T21:47:55Z

src/pds/registrysweepers/utils/__init__.py

@@ -143,14 +146,11 @@ def query_registry_db(
        total_hits = data["hits"]["total"]["value"]
        log.debug(f"   paging query ({served_hits} to {min(served_hits + page_size, total_hits)} of {total_hits})")

-        last_info_log_at_percentage = 0
-        log.info("Query progress: 0%")
-
        for hit in data["hits"]["hits"]:


there is a tqdm package which can help to track progress in a loop. I would prefer to use that rather than adding specific indicators which can make the code less readable regarding its primary focus.

tloubrieu-jpl · 2023-07-26T21:57:14Z

docker/sweepers_driver.py

@@ -92,6 +92,7 @@
    logging.error(err)
    raise ValueError(f'Failed to parse username/password from PROV_CREDENTIALS value "{provCredentialsStr}": {err}')

+log_level = parse_log_level(os.environ.get('LOGLEVEL', 'INFO'))


I remember that Sean recommended to use a log configuration file to configure the python logs, I don't know how that would integrate in a AWS deployment. We would need to mount the log configuration file in the docker image from a configuration store (I don't know what that is in AWS?). Then we would have the log level in this file.

I don't wnat to overcomplicate the changes right now, so let say it is for comment/discussion, not for change.

Can you serve config files or complex objects from the Parameter Store? Preferably whatever mechanism we use should be trivially-adjustable in AWS (as environment variables are) to allow us to switch into debug logging easily if an issue is found.

alexdunnjpl · 2023-07-26T22:05:43Z

@tloubrieu-jpl could I trouble you to open tickets for your two comments? The issues raised relate to longstanding existing code and are out-of-scope for this PR (i.e. I've left it better than I found it and y'all need these fixes yesterday).

Happy to pick up these improvements once we're out of the woods on the immediate ops issues.

EDIT

I don't wnat to overcomplicate the changes right now, so let say it is for comment/discussion, not for change.

Helps if I read the whole thing before firing off a response 😅

tloubrieu-jpl

I created a ticket for my comments (#47) to be implemented when not in a hurry.

alexdunnjpl added 7 commits July 26, 2023 14:12

implement aggregation of product update errors by error type and erro…

387cda6

…r reason this prevents the logs from getting blasted when batches full of similar errors present. does not aggregate when the product id appears in the reason

implement environment variable LOGLEVEL in sweepers_driver

9533c59

takes an int or string representation of a python standard log level like INFO

add lowercase support to utils.parse_log_level()

f6797c7

update README.md for LOGLEVEL env var

0ec9b08

fix bug in query progress logging

ea0bbdb

add explanation for Opensearch returning HTTP200 when failures exist

95cf878

remove redundant/misleading log message

ad1bea8

alexdunnjpl requested review from tloubrieu-jpl, nutjob4life and collinss-jpl as code owners July 26, 2023 21:35

sonatype-lift bot reviewed Jul 26, 2023

View reviewed changes

tloubrieu-jpl reviewed Jul 26, 2023

View reviewed changes

tloubrieu-jpl approved these changes Jul 26, 2023

View reviewed changes

alexdunnjpl changed the title ~~40 logging improvements~~ 40 add logging aggregation, LOGLEVEL env var, other logging tweaks Jul 26, 2023

alexdunnjpl merged commit 52c1529 into main Jul 26, 2023
1 check passed

alexdunnjpl deleted the 40-logging-improvements branch July 26, 2023 22:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

40 add logging aggregation, LOGLEVEL env var, other logging tweaks #45

40 add logging aggregation, LOGLEVEL env var, other logging tweaks #45

alexdunnjpl commented Jul 26, 2023

sonatype-lift bot commented Jul 26, 2023

sonatype-lift bot Jul 26, 2023

sonatype-lift bot Jul 26, 2023

tloubrieu-jpl left a comment

tloubrieu-jpl Jul 26, 2023

tloubrieu-jpl Jul 26, 2023

alexdunnjpl Jul 26, 2023

alexdunnjpl commented Jul 26, 2023 •

edited

Loading

tloubrieu-jpl left a comment

Command	Usage
`@sonatype-lift ignore`	Leave out the above finding from this PR
`@sonatype-lift ignoreall`	Leave out all the existing findings from this PR
`@sonatype-lift exclude <file\|issue\|path\|tool>`	Exclude specified `file\|issue\|path\|tool` from Lift findings by updating your config.toml file

40 add logging aggregation, LOGLEVEL env var, other logging tweaks #45

40 add logging aggregation, LOGLEVEL env var, other logging tweaks #45

Conversation

alexdunnjpl commented Jul 26, 2023

🗒️ Summary

⚙️ Test Data and/or Report

♻️ Related Issues

sonatype-lift bot commented Jul 26, 2023

Sonatype Lift is retiring

sonatype-lift bot Jul 26, 2023

Choose a reason for hiding this comment

sonatype-lift bot Jul 26, 2023

Choose a reason for hiding this comment

tloubrieu-jpl left a comment

Choose a reason for hiding this comment

tloubrieu-jpl Jul 26, 2023

Choose a reason for hiding this comment

tloubrieu-jpl Jul 26, 2023

Choose a reason for hiding this comment

alexdunnjpl Jul 26, 2023

Choose a reason for hiding this comment

alexdunnjpl commented Jul 26, 2023 • edited Loading

tloubrieu-jpl left a comment

Choose a reason for hiding this comment

alexdunnjpl commented Jul 26, 2023 •

edited

Loading