extractor_dict_iterator for solving path detection in object `kwargs` #3089

h-mayorquin · 2024-06-26T23:03:13Z

Extractor dicts (outputs of BaseExtractor.to_dict()) are nested structures where some arguments are paths.

For solving the problem in #3013 we need a way of filtering by the arguments that correspond to real paths (e.g. Path(argument).exists() is True) and only modify those.

The current approach is to use the recursive_path_modifier:
https://github.com/alejoe91/spikeinterface/blob/fa2ca6d7c804fc8ab3df93dbd463d9576bc7723b/src/spikeinterface/core/core_tools.py#L186

I guess at the beginning the purpose of the function was both iterating overt the dict and also modifying the values.
But then, some other functions appeared like get_path_list that use recursive_path_modifier to iterate over the extractor and accumulate path-like values.

This PR creates two functions that can accomplish the same as recursive_path_modifier but are decoupled:

First, it introduces extractor_dict_iterator that generates an iterator over all elements with three attributes: value, name and access path.
Second, a new set_value_in_extractor_dict function that uses the access path of the elements from the iterator to modify the value is also introduced.

With this logic, functions like find_recording_folders that only use the first part can avoid the part of recursive_path_modifier that modifies the values. Avoiding computational waste and simplifying intent. Moreover,, the problem in #3014 of having to check for path existence before or after a modification can be easily stated as a single line where elements are filtered instead of modifying the logic of recursive_path_modifier. In general it enables working with extractor dicts in the folowing way:

Collect items from the extractor dict
Filter to select some of them
Modify them

Note that this does not even need to be about paths. Any type of filter, reduce or concistency check and modification can be implemented with this.

I also modified the tests of make_relative_paths and make_absolute_paths to work on real paths so we can directly test the type of problem causes #3013

for more information, see https://pre-commit.ci

alejoe91

@h-mayorquin I love this! just some renaming suggestions

src/spikeinterface/core/core_tools.py

alejoe91 · 2024-06-27T07:03:21Z

@h-mayorquin this should cloe #3014 right?

…ator' into recording_dict_iterator

h-mayorquin · 2024-06-27T14:41:13Z

@alejoe91 did the correction and yes, this should close #3041

alejoe91 · 2024-06-27T14:47:00Z

@h-mayorquin can you make the previous test function in test_core_tools (test_path_utils_functions) also to use pytest.skip?

h-mayorquin · 2024-06-27T15:01:37Z

Yes, done.

alejoe91 · 2024-06-27T15:07:21Z

This looks good to me. Much much cleaner than the old presentation!

@samuelgarcia what do you think?

samuelgarcia · 2024-06-28T10:17:06Z

I guess at the beginning the purpose of the function was both iterating overt the dict and also modifying the values. But then, some other functions appeared like get_path_list that use recursive_path_modifier to iterate over the extractor and accumulate path-like values.

Excellent guess and analysis of the histical reason!
I wanted to do something similar to this PR this when impleneted _get_paths_list but was to lazy to refacor with the yield like you did.

Thanks a lot for this effort.

samuelgarcia · 2024-06-28T10:19:01Z

src/spikeinterface/core/core_tools.py

+        elif isinstance(dict_list_or_value, list):
+            for i, v in enumerate(dict_list_or_value):
+                yield from _extractor_dict_iterator(
+                    v, access_path + (i,), name=name


This is really smart to have the path both with key and index for list!!

samuelgarcia · 2024-06-28T10:23:20Z

src/spikeinterface/core/core_tools.py

@@ -183,6 +184,75 @@ def is_dict_extractor(d: dict) -> bool:
    return is_extractor


+extractor_dict_element = namedtuple(typename="extractor_dict_element", field_names=["value", "name", "access_path"])


very good idea!

h-mayorquin added 2 commits June 26, 2024 15:33

add recording iterator

a166e5a

add and fix tests

27a7c9a

h-mayorquin self-assigned this Jun 26, 2024

h-mayorquin requested a review from alejoe91 June 26, 2024 23:03

h-mayorquin added bug Something isn't working core Changes to core module labels Jun 26, 2024

naming

b3b85b2

h-mayorquin mentioned this pull request Jun 26, 2024

Only make existing paths relative #3014

Closed

1 task

h-mayorquin and others added 3 commits June 27, 2024 00:39

windows test remove inner conditional

d794c82

[pre-commit.ci] auto fixes from pre-commit.com hooks

c1e4eee

for more information, see https://pre-commit.ci

Merge branch 'main' into recording_dict_iterator

423f4bb

alejoe91 requested changes Jun 27, 2024

View reviewed changes

alejoe91 added this to the 0.101.0 milestone Jun 27, 2024

h-mayorquin added 2 commits June 27, 2024 08:07

Merge remote-tracking branch 'refs/remotes/origin/recording_dict_iter…

e1f5230

…ator' into recording_dict_iterator

@Alejo91 suggestion

2cc7199

make test skipif

3eee955

alejoe91 approved these changes Jun 27, 2024

View reviewed changes

h-mayorquin marked this pull request as ready for review June 27, 2024 16:20

samuelgarcia reviewed Jun 28, 2024

View reviewed changes

samuelgarcia merged commit 47dd371 into SpikeInterface:main Jun 28, 2024
17 checks passed

h-mayorquin deleted the recording_dict_iterator branch June 28, 2024 14:50

h-mayorquin mentioned this pull request Jun 28, 2024

electrical_series_path/unit_table_path are made relative with relative_to #3013

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

extractor_dict_iterator for solving path detection in object `kwargs` #3089

extractor_dict_iterator for solving path detection in object `kwargs` #3089

h-mayorquin commented Jun 26, 2024 •

edited

Loading

alejoe91 left a comment

alejoe91 commented Jun 27, 2024

h-mayorquin commented Jun 27, 2024

alejoe91 commented Jun 27, 2024

h-mayorquin commented Jun 27, 2024

alejoe91 commented Jun 27, 2024

samuelgarcia commented Jun 28, 2024

samuelgarcia Jun 28, 2024

samuelgarcia Jun 28, 2024

		@@ -183,6 +184,75 @@ def is_dict_extractor(d: dict) -> bool:
		return is_extractor


		extractor_dict_element = namedtuple(typename="extractor_dict_element", field_names=["value", "name", "access_path"])

extractor_dict_iterator for solving path detection in object kwargs #3089

extractor_dict_iterator for solving path detection in object kwargs #3089

Conversation

h-mayorquin commented Jun 26, 2024 • edited Loading

alejoe91 left a comment

Choose a reason for hiding this comment

alejoe91 commented Jun 27, 2024

h-mayorquin commented Jun 27, 2024

alejoe91 commented Jun 27, 2024

h-mayorquin commented Jun 27, 2024

alejoe91 commented Jun 27, 2024

samuelgarcia commented Jun 28, 2024

samuelgarcia Jun 28, 2024

Choose a reason for hiding this comment

samuelgarcia Jun 28, 2024

Choose a reason for hiding this comment

extractor_dict_iterator for solving path detection in object `kwargs` #3089

extractor_dict_iterator for solving path detection in object `kwargs` #3089

h-mayorquin commented Jun 26, 2024 •

edited

Loading