Add logging to API #2051

anth-volk · 2024-12-09T19:12:06Z

Fixes #1947.

This PR adds logging to both the API and attached workers, as well as testing to the base Logger class (but not yet WorkerLogger). I'm opening this in draft because there are still a few drawbacks (below) that I'm stuck on, and I'd love any assistance the requested reviewers might have. This is very much a work in progress, and as I've delved into some of the complexities of logging, I realize that there are a lot of considerations this code may not be handling, even at a design level. Any and all feedback is welcomed.

Drawbacks:

When running Flask in debug mode, Flask starts a server process, triggering the start of the logger. However, it then uses this process to start a child process also running the API, but with Werkzeug debugging tools, creating yet another log file. It's this child process that will ultimately be in use. I've tried checking for particular env variables and only launching when the right combo is present, but this feels hackish and has negative ramifications for the worker. I've also tried using the Flask before_request feature, but that's not a solution.
I was unable to properly launch memory usage logging. My preferred implementation merely copied what I had done previously in Add logging to worker #1948, but this either crashed (if I used a weak ref to store the logger) or improperly logged memory stats for all workers (if using a strong one). I am unsure how to proceed on this.
The reform impact service is typically only run alongside a job, which is itself run on a worker. However, the service interfaces with portions of the code that are not isolated to the job itself, creating challenges when choosing a logging file to write to. If logging with the more basic Logger class, the service will actually create a new log file, perhaps because it is run within the worker thread, not the main API one. However, if logging with the WorkerLogger class, the service has no access to the proper worker ID (at least, not that I found), so it will create a file with the wrong name and log to a separate file from the worker itself.

mikesmit

Pausing here because I think the main comment about "why aren't we doing this the usual log4x way" is the most relevant and impacts the rest of the review.

mikesmit · 2024-12-09T19:52:35Z

policyengine_api/utils/logger.py

+import os
+
+
+class Logger:


discussion, blocking With the qualifier I am looking at this as a fairly standard Log4X-style logger, but there may be aspects /conventions in python that make this different...

Generally I would expect you to just use logging.getLogger in your classes like this

And configure your logs globally at the root logger level rather than doing it on child loggers, generally with something like logging.config.fileConfig or command line arguments or whatever).

Where you might have different settings for the main app thread vs. the worker thread and you could tune specific loggers

suggestion I think this is worth a discussion. You may have gone this way for specific reasons that I'm not aware of.

mikesmit · 2024-12-09T20:17:03Z

policyengine_api/utils/logger.py

+
+        # Format message with context if provided
+        if context:
+            context_str = " ".join(f"{k}={v}" for k, v in context.items())


This makes more sense to me in terms of standardizing your message string. you don't necessarily need to wrap logger to do this though, you could just write a 'format_context' function or whatever:

logger.info(format_context(message, context))

mikesmit · 2024-12-09T20:23:35Z

policyengine_api/utils/logger.py

+        self.cloud_client: cloud_logging.Client = None
+
+        # Prevent duplicate handlers
+        if not self.logger.handlers:


I believe if we configure as discussed above you don't have to worry about this.

mikesmit · 2024-12-09T20:24:18Z

policyengine_api/utils/logger.py

+            self.logger.addHandler(console_handler)
+
+            # Google Cloud Logging handler; don't log to GCP if in debug
+            if log_to_cloud and os.environ.get("FLASK_DEBUG") != "1":


Similarly you could configure this at the application level rather than in code at each logger declaration. Maybe there are reasons to only cloud log some things, but I'd generally expect it to be on/off for the whole app.

mikesmit · 2024-12-09T20:26:09Z

policyengine_api/utils/worker_logger.py

+            monitor_memory (bool): Whether to monitor memory usage (defaults to True)
+            memory_threshold (int): Memory usage threshold to trigger warnings (default: 75%)
+            memory_check_interval (int): How often to check memory in seconds (default: 5)


nitt, documentation Arguments in documentation not in the constructor. suggestion remove them

mikesmit · 2024-12-09T20:40:17Z

policyengine_api/utils/logger.py

+        self.logger = logging.getLogger(self.name)
+        self.logger.setLevel(logging.INFO)
+
+        # Create log directory if it doesn't exist
+        try:
+            self.dir.mkdir(parents=True, exist_ok=True)
+        except Exception as e:


question, blocking - who/what is this for? Google app engine looks at either stdout/stderr or the cloud logging API.

For that reason I'm surprised we write this by default even in the cloud.

suggestion I would not do this by default I would only do it in dev mode. In the context of the suggestion above I would pass a different config in dev than in prod.

anth-volk added 17 commits December 4, 2024 13:53

feat: Create default logger, gitignore logs

40b63f3

fix: Fix logging to GCP

2469c1c

fix: Switch UUID for timestamp

e8a4196

feat: Add logging to all routes

6ce6a9a

feat: Add logging to services; create Logger.error() method

7747bc7

chore: Add tests

beb6519

feat: Beginnings of WorkerLogger

9a44872

fix: Remove unstable debug check

cfdcd6c

fix: Change default logger name to 'api_main'

804f103

fix: Improve worker logger naming

bfaba0f

fix: Intialize worker properly upon worker launch

c1c715f

chore: Add type hints

1134775

chore: Update tests

3015bdb

fix: Migrate routes to context

f3f0264

fix: Use context for all services

6c66cbc

feat: Add proper memory logging functions

b0978e7

fix: Remove memory logging

47dba7e

anth-volk marked this pull request as draft December 9, 2024 19:12

anth-volk requested a review from mikesmit December 9, 2024 19:12

chore: Changelog

5186b6f

mikesmit reviewed Dec 9, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add logging to API #2051

Add logging to API #2051

anth-volk commented Dec 9, 2024 •

edited

Loading

mikesmit left a comment

mikesmit Dec 9, 2024

mikesmit Dec 9, 2024

mikesmit Dec 9, 2024

mikesmit Dec 9, 2024

mikesmit Dec 9, 2024

mikesmit Dec 9, 2024

Add logging to API #2051

Are you sure you want to change the base?

Add logging to API #2051

Conversation

anth-volk commented Dec 9, 2024 • edited Loading

mikesmit left a comment

Choose a reason for hiding this comment

mikesmit Dec 9, 2024

Choose a reason for hiding this comment

mikesmit Dec 9, 2024

Choose a reason for hiding this comment

mikesmit Dec 9, 2024

Choose a reason for hiding this comment

mikesmit Dec 9, 2024

Choose a reason for hiding this comment

mikesmit Dec 9, 2024

Choose a reason for hiding this comment

mikesmit Dec 9, 2024

Choose a reason for hiding this comment

anth-volk commented Dec 9, 2024 •

edited

Loading