Add an option for count_num_spikes_per_unit #2209

samuelgarcia · 2023-11-15T13:07:33Z

This should be merge after #2198

Spike vector should be the default, as computing 1 spike train will mean the cache is available (but doesn't mean it's faster to use the cached spike trains)

…rface into count_spike_array

…_per_unit()

samuelgarcia · 2023-11-15T13:09:14Z

With this, some part of the code using count_num_spikes_per_unit() can be rewritten with array in mind and should be better than looping over a dict. (for instance using np.any, np.all, ...)

should I do this in the same PR ?

h-mayorquin · 2023-11-15T13:59:14Z

@samuelgarcia

should I do this in the same PR ?

I vote not, one PR per feature as much as we can.

h-mayorquin · 2023-11-15T14:01:11Z

src/spikeinterface/core/basesorting.py

@@ -269,35 +269,40 @@ def get_total_num_spikes(self):
        )
        return self.count_num_spikes_per_unit()

-    def count_num_spikes_per_unit(self) -> dict:
+    def count_num_spikes_per_unit(self, output="dict"):


I think it would be better to just make another function for this behavior.

What is the advantage of doing it with a keyword?
The disavatnages are:

Harder to discover

More complicated output type which makes harder to reason about code.

Adds complexity to the function.

Functions should do one thing and do it well. Specially at the core. Or make two private methods that do the specific task and make this a dispatcher with options.

For me the main advantage is internal use of sorting.count_num_spikes_per_unit(output="array")
replacing at several places the uggly np.array(list(sorting.count_num_spikes_per_unit().values()))

But you can just have another method that will allow you to do exactly the same:

sorting.count_num_spikes_per_unit_array instead of np.array(list(sorting.count_num_spikes_per_unit().values()))

I think the method is a good idea.
I think mixing this new logic in the old method is not for the reasons stated above.

I am asking: why do you want to mix the logic to produce counts as arrays and counts as dict in the same fuction? What's the advantage of it? Is than an aesthethical thing? You dislike having more functions? You personally find keywords easier to discover than functions? I am looking for your reasoning here.

"Functions should do only one thing and do it well" seems like a really good principle to me that is more or less widely accepted. Design rules can -and sometimes should- be broken, but I think we should have a good reason to do it. Which one is here?

propagate the outputs option at several place when it make sens.

samuelgarcia · 2023-11-17T10:49:39Z

@DradeAW : can you chekc this ?

src/spikeinterface/core/basesorting.py

DradeAW · 2023-11-17T10:56:18Z

I think the way you implemented it is the right way: using spike trains if they are all cached or if the spike_vector isn't cached.

Also love the idea of getting an array or a dict!
I never know the return type and often need to convert it depending on my needs, I like the option :)

I'll make some test to make sure it makes things faster on my code.

DradeAW · 2023-11-17T11:03:53Z

Yep I do have a noticeable speedup (several seconds)!

Thanks @samuelgarcia :)

zm711

Although I do think that separating out functions is likely better, I added docstring/error clarifications with the assumption that this PR will remain as is. Feel free to ignore if you plan to change the structure.

src/spikeinterface/core/basesorting.py

Co-authored-by: Zach McKenzie <[email protected]>

for more information, see https://pre-commit.ci

DradeAW and others added 4 commits November 13, 2023 15:42

Improvement when counting num spikes

65c5024

Spike vector should be the default, as computing 1 spike train will mean the cache is available (but doesn't mean it's faster to use the cached spike trains)

Merge branch 'main' into num_spikes_vector_fast

3601564

Merge branch 'num_spikes_vector_fast' of github.com:DradeAW/spikeinte…

02cd2ca

…rface into count_spike_array

Make optionally otuput="dict" or "array" for sorting.count_num_spikes…

3863f75

…_per_unit()

samuelgarcia mentioned this pull request Nov 15, 2023

Improvement when counting num spikes #2198

Closed

alejoe91 added enhancement New feature or request core Changes to core module labels Nov 15, 2023

h-mayorquin reviewed Nov 15, 2023

View reviewed changes

Improve count_num_spikes_per_unit() strategy speed.

ff9d0ba

propagate the outputs option at several place when it make sens.

DradeAW reviewed Nov 17, 2023

View reviewed changes

src/spikeinterface/core/basesorting.py Outdated Show resolved Hide resolved

zm711 reviewed Nov 17, 2023

View reviewed changes

src/spikeinterface/core/basesorting.py Outdated Show resolved Hide resolved

src/spikeinterface/core/basesorting.py Outdated Show resolved Hide resolved

src/spikeinterface/core/basesorting.py Outdated Show resolved Hide resolved

h-mayorquin mentioned this pull request Nov 18, 2023

Add spike-train based lazy SortingGenerator #2227

Merged

samuelgarcia mentioned this pull request Nov 20, 2023

Allow precomputing spike trains #2175

Merged

samuelgarcia and others added 4 commits November 22, 2023 07:53

Zach the typos killer

b5d870f

Co-authored-by: Zach McKenzie <[email protected]>

yep

712b1e0

Merge branch 'main' into count_spike_array

b09ee96

[pre-commit.ci] auto fixes from pre-commit.com hooks

a6bd539

for more information, see https://pre-commit.ci

alejoe91 approved these changes Nov 22, 2023

View reviewed changes

alejoe91 merged commit 5d7b64e into SpikeInterface:main Nov 22, 2023
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add an option for count_num_spikes_per_unit #2209

Add an option for count_num_spikes_per_unit #2209

samuelgarcia commented Nov 15, 2023

samuelgarcia commented Nov 15, 2023

h-mayorquin commented Nov 15, 2023

h-mayorquin Nov 15, 2023

samuelgarcia Nov 22, 2023

h-mayorquin Nov 22, 2023 •

edited

Loading

samuelgarcia commented Nov 17, 2023

DradeAW commented Nov 17, 2023

DradeAW commented Nov 17, 2023

zm711 left a comment

Add an option for count_num_spikes_per_unit #2209

Add an option for count_num_spikes_per_unit #2209

Conversation

samuelgarcia commented Nov 15, 2023

samuelgarcia commented Nov 15, 2023

h-mayorquin commented Nov 15, 2023

h-mayorquin Nov 15, 2023

Choose a reason for hiding this comment

samuelgarcia Nov 22, 2023

Choose a reason for hiding this comment

h-mayorquin Nov 22, 2023 • edited Loading

Choose a reason for hiding this comment

samuelgarcia commented Nov 17, 2023

DradeAW commented Nov 17, 2023

DradeAW commented Nov 17, 2023

zm711 left a comment

Choose a reason for hiding this comment

h-mayorquin Nov 22, 2023 •

edited

Loading