ENH/REF: Enable logging through multiprocessing, refactor node work signatures #41

tangkong · 2024-09-05T21:15:45Z

Description

Adds logging framework to allow workers in multiprocessing.Process to log to a central logger
Refactors ActionNode work function signatures to expect a py_trees.common.Status returned. This allows us to do common nitty gritty work and not expose that to the user.
Adjusts tests that are affected by these changes. Also makes tests more pytest-y
Removes ability for Worker to "bind" methods to itself (this never worked properly in the first place)

Example Output

`pytest test_check_and_do.py -s`

(pcds-5.9.0+beams)roberttk@psbuild-rhel7-01:~/devrepos/BEAMS(enh_mp_logging -)$ python -m pytest beams/tests/test_check_and_do.py  -s
============================================================================================ test session starts =============================================================================================
platform linux -- Python 3.9.18, pytest-8.1.1, pluggy-1.4.0
benchmark: 4.0.0 (defaults: timer=time.perf_counter disable_gc=False min_rounds=5 min_time=0.000005 max_time=1.0 calibration_precision=10 warmup=False warmup_iterations=100000)
PyQt5 5.15.9 -- Qt runtime 5.15.8 -- Qt compiled 5.15.8
rootdir: /cds/home/r/roberttk/devrepos/BEAMS
configfile: pyproject.toml
plugins: xdist-3.5.0, jaxtyping-0.2.21, anyio-3.7.1, benchmark-4.0.0, timeout-2.1.0, asyncio-0.23.5, typeguard-2.13.3, cov-4.1.0, repeat-0.9.2, qt-4.4.0
asyncio: mode=strict
collected 1 item                                                                                                                                                                                             

beams/tests/test_check_and_do.py DEBUG    - PID 3458 __init__ creating worker from 3458
DEBUG    - PID 3458 __init__ creating worker from 3458
[DEBUG] action               : ActionNode.__init__()
DEBUG    - PID 3458 __init__ ConditionNode.__init__()
DEBUG    - PID 3458 __init__ ConditionNode.__init__()
DEBUG    - PID 3458 setup ActionNode.setup()->connections to an external process
DEBUG    - PID 3458 setup ActionNode.setup()->connections to an external process
DEBUG    - PID 3458 start_work Starting work
DEBUG    - PID 3458 start_work Starting work
[DEBUG] yuhh                 : CheckAndDo.tick()
[DEBUG] yuhh                 : CheckAndDo.tick() [!RUNNING->reset current_child]
[DEBUG] check                : ConditionNode.tick()
DEBUG    - PID 3458 update Ticking: check results in Status.FAILURE
DEBUG    - PID 3458 update Ticking: check results in Status.FAILURE
[DEBUG] check                : ConditionNode.stop(Status.INVALID->Status.FAILURE)
[DEBUG] action               : ActionNode.tick()
DEBUG    - PID 3458 initialise Initliazing action...
DEBUG    - PID 3458 initialise Initliazing action...
DEBUG    - PID 3458 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3458 update Getting tick on action. Status: Status.RUNNING
<...snip...>
[DEBUG] action               : ActionNode.stop(Status.RUNNING->Status.INVALID)
DEBUG    - PID 3458 terminate ActionNode.terminate()[Status.RUNNING->Status.INVALID]
DEBUG    - PID 3458 terminate ActionNode.terminate()[Status.RUNNING->Status.INVALID]
.

<...Coverage table...>

============================================================================================= 1 passed in 2.02s ==============================================================================================

An external script

The Script

import os
import time
import logging
from multiprocessing import Value
from typing import Callable

import py_trees

from beams.logging import setup_logging, configure_log_directory, LOGGER_THREAD, LOGGER_QUEUE
from beams.behavior_tree.ActionNode import ActionNode
from beams.behavior_tree.ConditionNode import ConditionNode

logger = logging.getLogger('beams.my_module')


if __name__ == "__main__":
    print(f'{os.getpid()} tan: {LOGGER_THREAD}, {LOGGER_QUEUE}')
    configure_log_directory('.')
    setup_logging(logging.DEBUG)
    # For test
    percentage_complete = Value("i", 0)

    def work_func(comp_condition: Callable) -> py_trees.common.Status:
        print(f'{os.getpid()} work func logger {logger}')
        # py_trees.console.logdebug(f"yuh {percentage_complete.value}, {volatile_status.get_value()}")
        percentage_complete.value += 10
        if comp_condition():
            return py_trees.common.Status.SUCCESS
        logger.debug(f"percentage complete -> {percentage_complete.value}")
        time.sleep(0.001)
        return py_trees.common.Status.RUNNING

    def comp_cond():
        return percentage_complete.value >= 100

    action = ActionNode(name="action", work_func=work_func,
                        completion_condition=comp_cond)
    action.setup()
    for _ in range(20):
        time.sleep(0.01)
        action.tick_once()
    print(LOGGER_THREAD, LOGGER_QUEUE)
    assert percentage_complete.value == 100

The Output

(pcds-5.9.0+beams)roberttk@psbuild-rhel7-01:~/devrepos/BEAMS(enh_mp_logging -)$ python ~/test/beams/tan.py 
3832 tan: None, <multiprocessing.queues.Queue object at 0x7fc550655eb0>
DEBUG    - PID 3832 __init__ creating worker from 3832
DEBUG    - PID 3832 __init__ creating worker from 3832
DEBUG    - PID 3832 setup ActionNode.setup()->connections to an external process
DEBUG    - PID 3832 setup ActionNode.setup()->connections to an external process
DEBUG    - PID 3832 start_work Starting work
DEBUG    - PID 3832 start_work Starting work
DEBUG    - PID 3832 initialise Initliazing action...
DEBUG    - PID 3832 initialise Initliazing action...
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3843 work_wrapper WAITING FOR INIT from node: action
DEBUG    - PID 3843 work_wrapper WAITING FOR INIT from node: action
DEBUG    - PID 3843 work_wrapper WAITING FOR INIT from node: action
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3843 work_wrapper WAITING FOR INIT from node: action
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
3843 work func logger <Logger beams.my_module (DEBUG)>
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3843 work_func percentage complete -> 10
DEBUG    - PID 3843 work_func percentage complete -> 10
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
DEBUG    - PID 3843 work_func percentage complete -> 10
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3843 work_func percentage complete -> 10
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
3843 work func logger <Logger beams.my_module (DEBUG)>
DEBUG    - PID 3843 work_func percentage complete -> 20
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
DEBUG    - PID 3843 work_func percentage complete -> 20
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
DEBUG    - PID 3843 work_func percentage complete -> 20
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3843 work_func percentage complete -> 20
3843 work func logger <Logger beams.my_module (DEBUG)>
DEBUG    - PID 3843 work_func percentage complete -> 30
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
DEBUG    - PID 3843 work_func percentage complete -> 30
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3843 work_func percentage complete -> 30
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
DEBUG    - PID 3843 work_func percentage complete -> 30
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
3843 work func logger <Logger beams.my_module (DEBUG)>
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3843 work_func percentage complete -> 40
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3843 work_func percentage complete -> 40
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
DEBUG    - PID 3843 work_func percentage complete -> 40
DEBUG    - PID 3843 work_func percentage complete -> 40
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
3843 work func logger <Logger beams.my_module (DEBUG)>
DEBUG    - PID 3843 work_func percentage complete -> 50
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3843 work_func percentage complete -> 50
DEBUG    - PID 3843 work_func percentage complete -> 50
DEBUG    - PID 3843 work_func percentage complete -> 50
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
3843 work func logger <Logger beams.my_module (DEBUG)>
DEBUG    - PID 3843 work_func percentage complete -> 60
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3843 work_func percentage complete -> 60
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3843 work_func percentage complete -> 60
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
DEBUG    - PID 3843 work_func percentage complete -> 60
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
3843 work func logger <Logger beams.my_module (DEBUG)>
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3843 work_func percentage complete -> 70
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3843 work_func percentage complete -> 70
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3843 work_func percentage complete -> 70
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
DEBUG    - PID 3843 work_func percentage complete -> 70
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
3843 work func logger <Logger beams.my_module (DEBUG)>
DEBUG    - PID 3843 work_func percentage complete -> 80
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3843 work_func percentage complete -> 80
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
DEBUG    - PID 3843 work_func percentage complete -> 80
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
DEBUG    - PID 3843 work_func percentage complete -> 80
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
3843 work func logger <Logger beams.my_module (DEBUG)>
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
DEBUG    - PID 3843 work_func percentage complete -> 90
DEBUG    - PID 3843 work_func percentage complete -> 90
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
DEBUG    - PID 3843 work_func percentage complete -> 90
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3843 work_func percentage complete -> 90
3843 work func logger <Logger beams.my_module (DEBUG)>
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.SUCCESS
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.RUNNING
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.SUCCESS
DEBUG    - PID 3843 work_wrapper Worker for node (action) completed.
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3843 work_wrapper Worker for node (action) completed.
DEBUG    - PID 3843 work_wrapper CALLING CAGET FROM from node (action)
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.SUCCESS
DEBUG    - PID 3832 update Getting tick on action. Status: Status.SUCCESS
DEBUG    - PID 3843 work_wrapper Setting node (action): Status.SUCCESS
DEBUG    - PID 3843 work_wrapper Worker for node (action) completed.
DEBUG    - PID 3832 update Getting tick on action. Status: Status.SUCCESS
DEBUG    - PID 3832 update ActionNode.update()[Status.RUNNING->Status.SUCCESS][Processing finished]
DEBUG    - PID 3843 work_wrapper Worker for node (action) completed.
DEBUG    - PID 3832 update ActionNode.update()[Status.RUNNING->Status.SUCCESS][Processing finished]
DEBUG    - PID 3832 terminate ActionNode.terminate()[Status.RUNNING->Status.SUCCESS]
DEBUG    - PID 3832 terminate ActionNode.terminate()[Status.RUNNING->Status.SUCCESS]
DEBUG    - PID 3832 initialise Initliazing action...
DEBUG    - PID 3832 initialise Initliazing action...
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
DEBUG    - PID 3832 update Getting tick on action. Status: Status.RUNNING
None <multiprocessing.queues.Queue object at 0x7fc550655eb0>
INFO     - PID 3832 stop_work Calling stop work on
INFO     - PID 3832 stop_work Calling stop work on
INFO     - PID 3832 stop_work Sending terminate signal to process
INFO     - PID 3832 stop_work Sending terminate signal to process
DEBUG    - PID 3832 stop_work Ending work, calling join
DEBUG    - PID 3832 stop_work Ending work, calling join
DEBUG    - PID 3832 stop_work Worker process joined
DEBUG    - PID 3832 stop_work Worker process joined

Example file outputs

2024-09-05 18:35:45 - PID 3832      ActionNode.py: 31  __init__           DEBUG    creating worker from 3832
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 31  __init__           DEBUG    creating worker from 3832
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 79  setup              DEBUG    ActionNode.setup()->connections to an external process
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 79  setup              DEBUG    ActionNode.setup()->connections to an external process
2024-09-05 18:35:45 - PID 3832          Worker.py: 42  start_work         DEBUG    Starting work
2024-09-05 18:35:45 - PID 3832          Worker.py: 42  start_work         DEBUG    Starting work
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 93  initialise         DEBUG    Initliazing action...
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 93  initialise         DEBUG    Initliazing action...
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 55  work_wrapper       DEBUG    WAITING FOR INIT from node: action
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 55  work_wrapper       DEBUG    WAITING FOR INIT from node: action
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 55  work_wrapper       DEBUG    WAITING FOR INIT from node: action
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 55  work_wrapper       DEBUG    WAITING FOR INIT from node: action
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 10
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 10
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 10
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 10
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 20
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 20
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 20
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 20
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 30
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 30
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 30
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 30
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 40
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 40
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 40
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 40
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 50
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 50
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 50
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 50
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 60
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 60
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 60
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 60
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 70
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 70
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 70
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 70
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 80
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 80
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 80
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 80
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 90
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 90
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 90
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843             tan.py: 29  work_func          DEBUG    percentage complete -> 90
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.SUCCESS
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.SUCCESS
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.RUNNING
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 72  work_wrapper       DEBUG    Worker for node (action) completed.
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 72  work_wrapper       DEBUG    Worker for node (action) completed.
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 61  work_wrapper       DEBUG    CALLING CAGET FROM from node (action)
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.SUCCESS
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 64  work_wrapper       DEBUG    Setting node (action): Status.SUCCESS
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.SUCCESS
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.SUCCESS
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 72  work_wrapper       DEBUG    Worker for node (action) completed.
2024-09-05 18:35:45 - PID 3843      ActionNode.py: 72  work_wrapper       DEBUG    Worker for node (action) completed.
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 110 update             DEBUG    ActionNode.update()[Status.RUNNING->Status.SUCCESS][Processing finished]
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 110 update             DEBUG    ActionNode.update()[Status.RUNNING->Status.SUCCESS][Processing finished]
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 126 terminate          DEBUG    �[31mActionNode.terminate()[Status.RUNNING->Status.SUCCESS]�[0m
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 126 terminate          DEBUG    �[31mActionNode.terminate()[Status.RUNNING->Status.SUCCESS]�[0m
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 93  initialise         DEBUG    Initliazing action...
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 93  initialise         DEBUG    Initliazing action...
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3832      ActionNode.py: 99  update             DEBUG    Getting tick on action. Status: Status.RUNNING
2024-09-05 18:35:45 - PID 3832          Worker.py: 45  stop_work          INFO     Calling stop work on
2024-09-05 18:35:45 - PID 3832          Worker.py: 45  stop_work          INFO     Calling stop work on
2024-09-05 18:35:45 - PID 3832          Worker.py: 50  stop_work          INFO     Sending terminate signal to process
2024-09-05 18:35:45 - PID 3832          Worker.py: 50  stop_work          INFO     Sending terminate signal to process
2024-09-05 18:35:45 - PID 3832          Worker.py: 57  stop_work          DEBUG    Ending work, calling join
2024-09-05 18:35:45 - PID 3832          Worker.py: 57  stop_work          DEBUG    Ending work, calling join
2024-09-05 18:35:45 - PID 3832          Worker.py: 59  stop_work          DEBUG    Worker process joined
2024-09-05 18:35:45 - PID 3832          Worker.py: 59  stop_work          DEBUG    Worker process joined

Motivation and Context

More info to come here... As well as how to's on actually enabling logging

Logging through multiprocessing

Prepare for a bit of a diatribe, as I don't want my struggles to be confined to my own notes forever.

Logging naturally works well within a single python process, and normal threading.Threads. Once we start wanting to log events in spawned multiprocessing.Process, we start to worry about serializing access to a single file/resource/stream across multiple python processes. Here in BEAMS we'll likely want to log to multiple locations, not just to the console but to files / GUI outputs / etc.

The simplest way to do this is pass logging records through a multiprocessing.Queue, and have the central logging process process those records as they're received. This logging process could also be in a separate multiprocessing.Process, but placing it in a daemon thread lets us start it and forget about it.

Sources:
https://docs.python.org/3/howto/logging-cookbook.html#logging-from-multiple-threads
https://github.com/pcdshub/hutch-python/blob/master/hutch_python/log_setup.py

ActionNode Work function fragmentation

This necessitated some changes to how we structure the work functions used by ActionNode. We were doing a lot of dirty work boilerplate in the "work_func", and I thought it sensible to separate that dirty work from the "business logic". This resulted in:

a work_func that returns a py_trees.common.Status: The "business logic"
a work_wrapper that sets up
- a continuous loop (possibly a place for enabling multi-shot stuff)
- inter-process logging

Naming here could be improved, but that's an issue for a different effort

I'd argue that if and when the exact form of these work_wrapper functions might vary, those differences should be crystallized in subclasses of ActionNode. This way the signatures are clear and documented (in code for now)

I'd also argue that the work_func signature should be codified somewhere. The work wrapper currently expects it to take the Callable completion_condition, but that's really a Check-and-Do specific formulation. We could potentially scope this so work func is always a Callable[[], Status]

The removal of `Worker.set_work_func`

Worker.set_work_func used to assign a function to Worker.work_func. There has historically been much confusion over "self", when it's needed, and when we have to provide it in additional arguments.

Dynamically binding methods to class instances is bad practice to begin with. A class exists to capture a structure and consistent interface through with other parts of the program can interact. By dynamically changing this, we confuse ourselves and other developers.

We already use work_func in two ways.

passing in a function, along with additional arguments if necessary
subclassing Worker and adding a work_func method (with a clear signature)

I argue this third one should be removed, as the other ways of assigning work to the Worker are more than flexible enough

How Has This Been Tested?

Interactively, and through tests.

Test have been adjusted when needed.

Where Has This Been Documented?

This PR, more docstrings to come

Pre-merge checklist

Code works interactively
Code follows the style guide
Code contains descriptive docstrings, including context and API
New/changed functions and methods are covered in the test suite where possible
Test suite passes locally
Test suite passes on GitHub Actions
Ran docs/pre-release-notes.sh and created a pre-release documentation page
Pre-release docs include context, functional descriptions, and contributors as appropriate

…ing from multiprocessing.Process

…n.Status, normalize logging in ActionNode/ConditionNode, adjust affected tests

…ut it

… more explicit and clear about what these work functions require as arguments

tangkong · 2024-09-06T01:33:03Z

Asking for preliminary reviews as I vomit words into the PR description

(Please unfold the details sections I spent a lot of time on them)

joshc-slac · 2024-09-06T19:44:42Z

Hmm this looks great and I see tests are passing on CI.

However on my machine test_father_tree_execution fails only when the two other tests in the file run. If we recall we had this bug before which is strange. I will try debug.

tangkong · 2024-09-06T20:17:48Z

I want to recall that being a caproto bug? (Or something related to how we're running the caproto IOCs in the suite?) Something about IOCs not being cleaned up? Both on CI here and on my local I can run the full test suite without problems, strange 🤔

ZLLentz · 2024-09-06T23:24:57Z

beams/behavior_tree/ActionNode.py

+        while not self.completion_condition():
+            logger.debug(f"CALLING CAGET FROM from node ({self.name})")
+            status = self.work_func(self.completion_condition)
+            volatile_status.set_value(status)
+            logger.debug(f"Setting node ({self.name}): {volatile_status.get_value()}")


This while loop has no failure escape condition. It only ends if we error out and bounce out of the function entirely or if the completion condition succeeds. Perhaps we should be expecting to catch an exception? Perhaps we should have a built-in timeout at this level?

This also means that the volatile_status can never be permenently set to a failure state as-written, since we'll never leave the loop without completion_condition being True and resetting the volatile_status to a success state below.

You're absolutely right, though this has been the reality from before this PR. Josh's implementation of multi-shot trees provides one escape hatch for this, but I think breaking on exception is a good stop-gap here. I have some difficulty thinking of a generally suitable timeout / iteration count limit.

For an action node, I think it does make sense to keep trying until we succeed. Maybe @joshc-slac can chime in here as to what would be appropriate, whether we address this here or let the multi-shot tree handle this.

The other thing to consider is whether or not we could rely on the work function itself to update the status and end the loop based on the status, I'm sure there are other ways to think about this though

@ZLLentz thanks for the super valuable diligence. To those points

I want to add the ability for all these nodes to timeout. This PR is a great step to enabling that in a single location. I think there may be other more cases in which we want to interrupt the while. Robert is right that Determine if and move work functions from the serialization document tree_config.py #40 allows program termination to clean up these, I might table that part of the discussion for Wednesday

@tangkong thanks to Zach's careful glance I too am realizing this would be a great place to implement benchmarking surrounding the work function. We can leave as a TODO, but would be neat to return how long the actual work of this action node took

To Zach's final point.... I could be convinced both ways, I am erring on agreeing with Robert's implementation here mainly for purposes of keeping IPC things at this layer. You're suggestion is elegant as well though

For now I'll just add an exception catch and we can expand on the behavior in another PR. I want to keep the contributions here as logging-focused as possible, I've submitted too many multi-faceted PRs haha

ZLLentz · 2024-09-06T23:26:51Z

beams/behavior_tree/ActionNode.py

+            work_func=self.work_wrapper,
+            comp_cond=completion_condition,
+            stop_func=None
+        )  # TODO: some standard notion of stop function could be valuable
        self.logger.debug("%s.__init__()" % (self.__class__.__name__))


There are some inconsistencies here where sometimes we use self.logger and other times we use logger. Is there some intuitive rule for when to use each one? Does there need to be some naming changes to make this clear?

Whoops I forgot I also need to switch out self.logger. self.logger invokes py_tree's logging facilities, which I haven't addressed in this PR. That should definitely be a followup.

I advocate for using logger = logging.getLogger(__name__) (which will give you a child logger of the beams logger throughout our library code.

ZLLentz · 2024-09-06T23:28:21Z

beams/behavior_tree/ActionNode.py

+        """
+        log_configurer(log_queue)
+        logger.debug(f"WAITING FOR INIT from node: {self.name}")
+        self.work_gate.wait()


Is self meant to be accessed from within the work_wrapper? Is this running in another process? How is the data shared?

We pass work_wrapper through the multiprocessing.Process, in which self will not be the same as the ActionNode in the main process. it's possible here that because work_gate is a process-safe Event that accessing it here and sharing it is ok. It is probably more clear for the work_gate to be passed through as an arg in the ActionWorker as you note.

I'll spend some time next week (or this weekend if I get bored) getting a better understanding of this

I did understand this correctly: when we pass an instance method to the process, the instance gets pickled and passed along with it. We can make no guarantees about any non-process-safe member of that instance, but the Value/Event/Gate objects will be synchronized across processes.

Good to know!

ZLLentz · 2024-09-06T23:33:42Z

beams/logging.py

+
+import yaml
+
+LOGGER_QUEUE = mp.Queue(-1)


I can't find a documentation reference to what -1 as maxsize does. What does it do? The maximum finite max size maybe? The docs I can find indicate that leaving this unset is how you get an unbounded queue.

It does give you an infinite size queue (closest docs are here https://docs.python.org/3/library/queue.html#queue.Queue, from the implication that mp.Queue implements most methods of queue.Queue)

Using -1 here should be equivalent to setting maxsize=0 or leaving it unset

ZLLentz · 2024-09-06T23:38:20Z

beams/logging.py

+        Queue that passes logging records between processes
+    """
+    h = QueueHandler(queue)  # Just the one handler needed
+    root = logging.getLogger("beams")  # root logger for beams


Does this means that workers will only log messages that originate from beams itself?

This is correct (as in this is the intention of the code written here)

The logging config actually currently assigns both handlers to the root logger as well, which I did not initially intend. If we want to we can make this the root logger and just pass through all logging messages we receive

I think general loggers can be noisy enough that this makes sense as-is. Maybe we can consider a more verbose mode in the future if needed.

ZLLentz · 2024-09-06T23:38:44Z

beams/logging.py

+    LOGGER_THREAD = threading.Thread(
+        target=logger_thread, args=(LOGGER_QUEUE,), daemon=True
+    )
+    LOGGER_THREAD.start()


Does the main thread also send log messages to the queue, or does the main thread handle these differently?

The main thread logs messages normally, without the QueueHandler. The QueueHandler is only necessary to liberate worker logs from their multiprocessing prison.

Here normally just means there are no additional handlers intercepting the log message before it gets processed by the handlers we define on logging.yml for file and stream logging.

I got brain-crossed between threads and multiprocessing again, this is a good way to do it

ZLLentz

I asked a lot of questions but I like the approach you've taken to logging here

…tion

joshc-slac · 2024-09-09T20:48:47Z

Going over this with a fine comb to understand why my tests are failing locally (again locally only in the old caproto failure way where they pass when only one caproto test is run) may indicate some weirdness with caproto or more concerningly some weirdness with the library such that it is hardware dependent...

Pedantic other findings from that here:

BEAMS/beams/tree_config.py

Lines 175 to 176 in 6a166b8

	value = caget(self.termination_check.pv)

can we make this like this

BEAMS/beams/tree_config.py

Lines 215 to 217 in 6a166b8

    
           value = caget(self.pv) 
        
           logging.debug(f"(wf) {self.name}: Value is {value}")

More so in utilizing this awesome PR a logging statement here would be great; otherwise the current implemented logging doesn't quite capture the rising edge of the success condition:

BEAMS/beams/tree_config.py

Line 148 in 6a166b8

val = caget(self.pv)

ZLLentz

I'm gonna leave this here before I forget because I like the logging

tangkong · 2024-09-10T18:03:18Z

In response @joshc-slac 's more specific addition requests, I think there's a lot of places where we can flesh out logging messages. I might rather tackle those in a separate PR or in PRs that touch that code specifically. Mostly because I think it's a slippery slope toward making the diff in this PR cover the entire codebase.

I do agree with all of the suggestions, I just want to keep this PR focused. 🙏

joshc-slac

This is great. ~~I'm going to stop driving myself insane about this caproto pytest issue on my end.~~

Update: I'm silly, I had a rouge caproto process from a test id ctrl c'ed out of. This also would have been resolved if I rebooted my machine more frequently. Warning for future silly folks.

More importantly great logging PR thanks!

Update update this does seem real, but we can move forward and track in #42

joshc-slac · 2024-09-11T01:44:37Z

beams/sequencer/helpers/Worker.py

        # TODO: we may want to decorate work func so it prints proc id...
        if (work_func is None):
          self.work_proc = proc_type(target=self.work_func, name=self.proc_name)
        else:
          self.work_func = work_func
-          self.work_proc = proc_type(target=self.work_func, name=self.proc_name, args=(self, *add_args,))


Okay not passing self here any more somewhat fundamentally breaks #38.

The point was to get access to the self.do_work Value. This is fine for this PR and 38 is now rebased on top of this PR. It may come back or we may pass that one value more intentionally...

Okay I fixed on a new branch based from this PR. This now closes #38

This is in a similar vein to a thread from earlier in this PR. I did expect that this PR and #38 would conflict a bit, particularly in this refactor. I'm leaning toward not accessing any self member in the functions we pass through to Process-es, rather passing the process-safe events/values/etc in explicitly. Naming an argument "self" and passing a different object into that argument is particularly confusing from a python-convention point of view.

But what I really liked from this refactor was the cleaner work_func signature, so I could be convinced either way w.r.t

tangkong · 2024-09-11T02:19:00Z

Thanks for the reviews everyone! Merging!

tangkong added 7 commits September 5, 2024 12:03

ENH: Add logging framework and initial configuration, supporting logg…

57061b0

…ing from multiprocessing.Process

REF: adjust work_function expected signature to return py_trees.commo…

031d757

…n.Status, normalize logging in ActionNode/ConditionNode, adjust affected tests

MNT: replace print and logging.* calls with standard logger calls

7601c0e

MNT: adjust central logging only configure once and be more quiet abo…

8b5936d

…ut it

REF: Do not attempt to "bind" work functions to workers. We should be…

19af1ea

… more explicit and clear about what these work functions require as arguments

STY: remove unused imports

6bfd722

BLD: add pyyaml to requirements

db085a9

tangkong requested review from joshc-slac and ZLLentz September 6, 2024 01:32

DOC: pre-release notes

6a166b8

This was linked to issues Sep 6, 2024

ENH: add multiprocess compatible logging #37

Closed

MNT: be consistent between prints / logging #19

Closed

tangkong changed the title ~~WIP/ENH/REF: Enable logging through multiprocessing, refactor node work signatures~~ ENH/REF: Enable logging through multiprocessing, refactor node work signatures Sep 6, 2024

ZLLentz reviewed Sep 6, 2024

View reviewed changes

tangkong added 3 commits September 8, 2024 19:54

MNT: missed one self.logger for logger swap

bd104cb

MNT: only configure beams console logger level, remove root configura…

dc80cc9

…tion

MNT: exit work_wrapper loop on work_func exception

e7d64a4

ZLLentz approved these changes Sep 10, 2024

View reviewed changes

joshc-slac approved these changes Sep 10, 2024

View reviewed changes

joshc-slac reviewed Sep 11, 2024

View reviewed changes

tangkong merged commit b74af9a into pcdshub:master Sep 11, 2024
7 of 11 checks passed

This was referenced Sep 11, 2024

Non determinsistic segault seemingly introduced by #41 #42

Open

Utilizing "multi shot" ability of trees #38

Closed

FEAT: re-enable multi shot trees, ActionNode work function decorator, implement tests to utilize multi shot trees #43

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH/REF: Enable logging through multiprocessing, refactor node work signatures #41

ENH/REF: Enable logging through multiprocessing, refactor node work signatures #41

tangkong commented Sep 5, 2024 •

edited

Loading

tangkong commented Sep 6, 2024 •

edited

Loading

joshc-slac commented Sep 6, 2024

tangkong commented Sep 6, 2024 •

edited

Loading

ZLLentz Sep 6, 2024 •

edited

Loading

tangkong Sep 9, 2024

ZLLentz Sep 9, 2024 •

edited

Loading

joshc-slac Sep 9, 2024 •

edited

Loading

tangkong Sep 9, 2024

ZLLentz Sep 6, 2024

tangkong Sep 6, 2024

ZLLentz Sep 6, 2024

tangkong Sep 6, 2024

tangkong Sep 6, 2024

tangkong Sep 9, 2024 •

edited

Loading

ZLLentz Sep 9, 2024

ZLLentz Sep 6, 2024

tangkong Sep 6, 2024

ZLLentz Sep 6, 2024

tangkong Sep 9, 2024

ZLLentz Sep 9, 2024

ZLLentz Sep 6, 2024

tangkong Sep 9, 2024

ZLLentz Sep 9, 2024

ZLLentz left a comment

joshc-slac commented Sep 9, 2024 •

edited

Loading

ZLLentz left a comment

tangkong commented Sep 10, 2024

joshc-slac left a comment •

edited

Loading

joshc-slac Sep 11, 2024

joshc-slac Sep 11, 2024

tangkong Sep 11, 2024

tangkong commented Sep 11, 2024

ENH/REF: Enable logging through multiprocessing, refactor node work signatures #41

ENH/REF: Enable logging through multiprocessing, refactor node work signatures #41

Conversation

tangkong commented Sep 5, 2024 • edited Loading

Description

Example Output

Motivation and Context

How Has This Been Tested?

Where Has This Been Documented?

Pre-merge checklist

tangkong commented Sep 6, 2024 • edited Loading

joshc-slac commented Sep 6, 2024

tangkong commented Sep 6, 2024 • edited Loading

ZLLentz Sep 6, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ZLLentz Sep 9, 2024 • edited Loading

Choose a reason for hiding this comment

joshc-slac Sep 9, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tangkong Sep 9, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ZLLentz left a comment

Choose a reason for hiding this comment

joshc-slac commented Sep 9, 2024 • edited Loading

ZLLentz left a comment

Choose a reason for hiding this comment

tangkong commented Sep 10, 2024

joshc-slac left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tangkong commented Sep 11, 2024

tangkong commented Sep 5, 2024 •

edited

Loading

tangkong commented Sep 6, 2024 •

edited

Loading

tangkong commented Sep 6, 2024 •

edited

Loading

ZLLentz Sep 6, 2024 •

edited

Loading

ZLLentz Sep 9, 2024 •

edited

Loading

joshc-slac Sep 9, 2024 •

edited

Loading

tangkong Sep 9, 2024 •

edited

Loading

joshc-slac commented Sep 9, 2024 •

edited

Loading

joshc-slac left a comment •

edited

Loading