Features/add logging #60

MaGering · 2023-03-13T17:30:44Z

Fixes #21.

With this PR logging is added to the pipeline.

Before merging into `dev`-branch, please make sure that

if data flow was adjusted: data pipeline run finishes successfully.
new and adjusted code is formated using black and isort.
the CHANGELOG.rst was updated.
the docs were updated.

…agg_region

MaGering · 2023-03-28T09:45:43Z

Logging is ready! :)
I was wondering if I should adapt mastr.py, so that logging is passed where we have print function? Here for instance. I would need to pass the logger with the respective function.

…/digipipe into features/add_logging

nesnoj

Thanks for this feature including all different ways of running tasks - works well so far! 😸

Some thoughts from my side:

I like the idea of putting the dataset's folder where it belongs to. However, I find it a bit messy when it is saved to the data folder - the user wants to see the data quite often whereas the log is only checked if something goes wrong. What about a separate log folder?
I'm a bit curious about the file name. You use the actual file's name (without extension), but what if there're multiple files?
What about log/<DATASET_NAME>.log (in folder I proposed above)? Or just sth like log/dataset.log which would save implementing an additional function to obtain the dataset's name. We'd have DATASET_PATH / "log" / "dataset.log" everywhere which is sufficient to my mind. Are you fine with that?
- Sidenote: If we use wildcards and 10 jobs are working on 10 parallel tasks of the same dataset, everything will be logged to the same file, but I propose not to tackle this now.
Currently there's no logging of Linux shell commands' output. That can be easily achieved by redirecting Linux' standard output (stdout) and standard error (stderr) messages by adding 2>&1 > {log} (it's also described in the snakemake docs). I can do this.

One more thing: I'm getting all output twice at the moment (regular out from snakemake and additionally from the logger):

Could you suppress snakemake's log output? Or the output from our logger but it is richer than snakemake's so I'd prefer to see ours..

digipipe/store/datasets/bkg_vg250_districts_region/scripts/create.py

nesnoj · 2023-04-20T20:08:56Z

Logging is ready! :) I was wondering if I should adapt mastr.py, so that logging is passed where we have print function? Here for instance. I would need to pass the logger with the respective function.

Oh, I missed this one. Yes, that makes much sense!

nesnoj · 2023-04-20T20:15:24Z

One more question:
In snakefiles where the task is called via python <script>, you pass the log file - example:
https://github.com/rl-institut-private/digipipe/blob/c19ceb2c7162809c745a6d7cbf05843e691269f7/digipipe/store/datasets/bkg_vg250_region/create.smk#L20-L23
But you do not use the 4th arg actually:
https://github.com/rl-institut-private/digipipe/blob/c19ceb2c7162809c745a6d7cbf05843e691269f7/digipipe/store/datasets/bkg_vg250_region/scripts/create.py#L33-L37
How does it work out that the log is written to the right file?

MaGering · 2023-05-15T13:30:12Z

I like the idea of putting the dataset's folder where it belongs to. However, I find it a bit messy when it is saved to the data folder - the user wants to see the data quite often whereas the log is only checked if something goes wrong. What about a separate log folder?

I'm a bit curious about the file name. You use the actual file's name (without extension), but what if there're multiple files?
What about log/<DATASET_NAME>.log (in folder I proposed above)? Or just sth like log/dataset.log which would save implementing an additional function to obtain the dataset's name. We'd have DATASET_PATH / "log" / "dataset.log" everywhere which is sufficient to my mind. Are you fine with that?

Sidenote: If we use wildcards and 10 jobs are working on 10 parallel tasks of the same dataset, everything will be logged to the same file, but I propose not to tackle this now.

Thanks for your thoughts! I like the idea putting the log output to a separate directory! Since I wanted to keep the name of the data file in the content of the logging file, I decided to use the option of individual log files in the directory '.log' instead of the variant '.log/dataset.log'. The latter would require more adjustment and time to keep the name of the corresponding folder with datasets in it. Do you go along with that?

MaGering · 2023-05-15T13:30:50Z

Currently there's no logging of Linux shell commands' output. That can be easily achieved by redirecting Linux' standard output (stdout) and standard error (stderr) messages by adding 2>&1 > {log} (it's also described in the snakemake docs). I can do this.

That would be awesome! Thank you. :)

MaGering · 2023-05-15T14:00:16Z

Could you suppress snakemake's log output? Or the output from our logger but it is richer than snakemake's so I'd prefer to see ours..

I am quite at a loss as to how to silence this. Most of the entries have disappeared with commit a0a0676. Unfortunately I can't get rid of the ones you describe. I only found the --quiet option for this, but I can't get it built into the pipeline. Could you tell me how to do this @nesnoj?

MaGering added the development 🔧 label Mar 13, 2023

MaGering self-assigned this Mar 13, 2023

MaGering added 20 commits March 23, 2023 16:00

Add logging to config file

e0a3286

Add logging for bkg_vg250_districts_region

c96b898

Add logging for bkg_vg250_muns_region

900f084

Add logging for bkg_vg250_region

87f8d75

Add logging for bnetza_mastr_biomass_region and bnetza_mastr_biomass_…

88c9184

…agg_region

Add logging for bnetza_mastr_captions

7f50f59

Add logging for bnetza_mastr_combustion_region

b8b10a3

Add logging for bnetza_mastr_gsgk_region

69d28f5

Add logging for bnetza_mastr_hydro_region

78d01a8

Add logging for bnetza_mastr_pv_ground_region

07c5bb7

Add logging for bnetza_mastr_pv_roof_region

c0d67d6

Add logging for bnetza_mastr_storage_region

dccfb3e

Add logging for bnetza_mastr_wind_region

e9371e5

Add logging for osm_forest

40af44e

Add logging for population

e09d2c9

Add logging for bkg_vg250

c07e96e

Add logging for bnetza_mastr

48d81cd

Add logging for destatis_gv

db37e4a

Add logging for osm_filtered

ea7346b

Add logging for stala_st_pop_prog

fccb356

MaGering force-pushed the features/add_logging branch from f5e1a3b to fccb356 Compare March 27, 2023 15:46

MaGering added 3 commits March 27, 2023 17:55

Exchange print with logger

597a7e4

Update CHANGELOG.md

08c6a5e

Update WORKFLOW.md

1749176

MaGering marked this pull request as ready for review March 28, 2023 09:40

MaGering requested a review from nesnoj March 28, 2023 09:40

Silence logger output in console since messages are printed twice

a0a0676

MaGering added 2 commits March 28, 2023 17:21

Fix logging if not path provided or extracted to log file

c40bba7

Merge branch 'features/add_logging' of github.com:rl-institut-private…

3922d40

…/digipipe into features/add_logging

MaGering mentioned this pull request Mar 30, 2023

Outsource logging of esys to digipipe logging #68

Open

MaGering and others added 4 commits April 12, 2023 17:31

Fix logging path of destatis_gv

ddb2dda

Merge branch 'dev' into features/add_logging

a869de5

Merge branch 'dev' into features/add_logging

a0954b0

Fix EOF white spaces introduced in merge

05b710f

nesnoj requested changes Apr 20, 2023

View reviewed changes

digipipe/store/datasets/bkg_vg250_districts_region/scripts/create.py Outdated Show resolved Hide resolved

Redirect commands' standard streams to logging

c19ceb2

MaGering and others added 5 commits May 3, 2023 14:56

Merge branch 'dev' into features/add_logging

f4a7453

Merge branch 'dev' into features/add_logging

807ddeb

Make all pre-commit linters happy

59c9303

Outsource logging files to .log directory

dcd3d90

Merge branch 'dev' into features/add_logging

d6504d0

MaGering and others added 8 commits May 16, 2023 09:49

Merge branch 'dev' into features/add_logging

2c004cf

Add logging in datasets/mastr.py

97a8414

Merge branch 'dev' into features/add_logging

6e16a03

Add logging for population_region

b8d19fa

Add logging cleaning to rule clean

ad19d71

Merge branch 'dev' into features/add_logging

045608d

Merge branch 'dev' into features/add_logging

663fb57

Merge branch 'dev' into features/add_logging

1fb7e37

nesnoj unassigned MaGering Aug 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Features/add logging #60

Features/add logging #60

MaGering commented Mar 13, 2023 •

edited

Loading

MaGering commented Mar 28, 2023

nesnoj left a comment •

edited

Loading

nesnoj commented Apr 20, 2023

nesnoj commented Apr 20, 2023

MaGering commented May 15, 2023

MaGering commented May 15, 2023

MaGering commented May 15, 2023

Features/add logging #60

Are you sure you want to change the base?

Features/add logging #60

Conversation

MaGering commented Mar 13, 2023 • edited Loading

Before merging into dev-branch, please make sure that

MaGering commented Mar 28, 2023

nesnoj left a comment • edited Loading

Choose a reason for hiding this comment

nesnoj commented Apr 20, 2023

nesnoj commented Apr 20, 2023

MaGering commented May 15, 2023

MaGering commented May 15, 2023

MaGering commented May 15, 2023

MaGering commented Mar 13, 2023 •

edited

Loading

Before merging into `dev`-branch, please make sure that

nesnoj left a comment •

edited

Loading