New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

feature: create roi selection #90

Open

ajonkman wants to merge 55 commits into main from 087_create_roi_selection_ajonkman

ajonkman commented Sep 1, 2023

Beginnings of ROI selection features

ajonkman and others added 30 commits

September 1, 2023 16:44


          Add ROISelection and FunctionalLungspace

f006d31


          Unfinished GridSelection algorithm

9328a98


          Fix unwanted redefinition of built-in variable

7ada94c


          Update GridSelection.find_grid() function

cd21b8c


          Add tests for GridSelection

4987a62


          Add function to GridSelection to get a layout of the returned matrices

26f6a91


          Add some scratchpad code to notebook

67c9c62


          Update region boundary finding algorithm

9bce610


          Improve error messages

65dc08f


          Expand tests for GridSelection

d6bf3e3

Tests now include inputs with np.nan values and values that are not 1.


          Convert GridSelection to a dataclass

d71da31


          Add module specific exceptions

274a35e


          Add value checks in __post_init__

c5958cd


          Move NotImplementedError to proper place

d7f2f00


          Update get_region_boundaries method with warnings

0d0dd2f


          Implement shorter unpacking

45b72d6


          Add tests for initialization, warnings and exceptions

33ab9ae


          Add tests for matrix_layout()

9b04994


          Create convenience classes of most-used cases

27b78b5


          Add docstring to GridSelection class

a8856e1


          Remove v_split and h_split arguments from convenience classes' __init__

27df399


           Add split pixels functionality

c86321e


          Extend, update and improve documentation


          Unify split pixels an non-split pixels function to reduce code reuse

bed66d5


          Update tests to reflect split pixels and unification

cfa97ce


          Create module-specific warnings for catching in a GUI

bb65617


          Update documentation, removing True/False values

37b3c9b


          Add warnings for when there are more groups than vectors

451880e


          Allow orientation specific splitting

6a4aa95

This allows e.g. vertical regions to be split by row, by horizontal regions not be split by column.


          Add tests for when split_pixels is True

7a7d0d5

psomhorst added 7 commits

October 11, 2023 09:06


          Rename split_cols to split_columns

1434f7c


          Add default split arguments to convenience classes

a713a85


          Re-add checks for split arguments

b8faa91


          Add option to ignore or include nan rows/columns

6e86046


          Convert 2D ndarray to list of 1D arrays

b33bc86


          Make split/no-split methods more alike

4913ae5


          Simplify matrix creation

170ceea

psomhorst force-pushed the 087_create_roi_selection_ajonkman branch from 1045834 to 170ceea Compare

October 11, 2023 07:06

psomhorst added 5 commits

October 11, 2023 13:40


          Fix swapped split_rows / split_columns

ba0137f


          Reduce code repetition

7aaefce


          Add tests for initialization with split_rows/columns and ignore_nan_r…

337a9e3

…ows/columns


          Add tests for exceptions with(out) split_rows/columns

0681d55


          Improve documentation

a191c6c

psomhorst requested a review from DaniBodor

October 11, 2023 11:52

psomhorst self-assigned this

psomhorst added the enhancement label

psomhorst added this to the Create example classes for pre-processing and analysis milestone

psomhorst marked this pull request as ready for review

October 11, 2023 11:53

Contributor

psomhorst commented Oct 11, 2023 •

edited

Loading

@DaniBodor @ajonkman This feature is now ready for review. It has become vastly more complicated than I first intended due to these factors:

there should be a way to split a row or column between two regions, if otherwise regions would consist of a different number of rows/columns;
it was pointed out that splitting rows/columns should be set individually, so you can e.g. split the left/right without splitting a column, but then split ventral/dorsal with splitting rows;
in some cases, you want to ignore rows/columns that entirely consists of NaN (not a number) values, while in other you don't want to do that;
ignoring NaN values should also be set for rows and columns individually.

These features have now been built in.

I'm not entirely sure this API will work best, yet. I think it might be better to save the matrices to the object itself when calling find_grid(), and have a function to do the actual splitting of data into regions as well. But that concern can be addressed later. Maybe this API will just work.


          Linting

625d481

Contributor

psomhorst commented Oct 11, 2023 •

edited

Loading

TODO

Write tests where ignore_nan_columns and/or ignore_nan_rows is/are False.

psomhorst added 7 commits

October 27, 2023 11:26


          Add test for split pixels and NaN values, improve test documentation.

737d111


          Improve type hints for numpy arrays.

c0a66eb


          Use np.outer for row/column vector multiplication.

8369ce5

np.outer(a, b) is the same as a[:, np.newaxis] @ b[np.newaxis, :], when
a and b are 1D arrays.


          Use np.newaxis where appropriate.

b4d6c5c

np.newaxis is an alias of None, but more clear when reading the code.


          Use dictionary for choosing method, keeping it DRY.

a712e69


          Remove superfluous f-string indicators.

ad1a09b


          Remove unfinished FunctionalLungSpace selection class

af172d7

This class has been moved to its own branch 115_functionallungspace_psomhorst

psomhorst force-pushed the 087_create_roi_selection_ajonkman branch from 8e7a244 to af172d7 Compare

October 27, 2023 11:54

DaniBodor requested changes

View reviewed changes

Member

DaniBodor left a comment •

edited

Loading

I still need to look carefully at the tests, but thought I'd already open these comments/suggestions up anyway, as #119 has higher priority right now.

eitprocessing/roi_selection/gridselection.py

Comment on lines +16 to +22

+                  GridSelection allows for the creation a list of 2D arrays that can be used to divide a two- or
+                  higher-dimensional array into several regions structured in a grid. An instance of
+                  GridSelection contains information about how to subdivide an input matrix. Calling
+                  `find_grid(data)`, where data is a 2D array, results in a list of arrays with the same
+                  dimension as `data`, each representing a single region. Each resulting 2D array contains the
+                  value 0 for pixels that do not belong to the region, and the value 1 or any number between 0
+                  and 1 for pixels that (partly) belong to the region.

Member

DaniBodor Nov 8, 2023

Part of this is a rather generic explanation of what an ROI is/how we define it and handle it. Should that part of this docstring perhaps move to the docstring of the parent class?

also, see typo below:

Suggested change

      
                GridSelection allows for the creation a list of 2D arrays that can be used to divide a two- or
          
                higher-dimensional array into several regions structured in a grid. An instance of
          
                GridSelection contains information about how to subdivide an input matrix. Calling
          
                `find_grid(data)`, where data is a 2D array, results in a list of arrays with the same
          
                dimension as `data`, each representing a single region. Each resulting 2D array contains the
          
                value 0 for pixels that do not belong to the region, and the value 1 or any number between 0
          
                and 1 for pixels that (partly) belong to the region.
          
                GridSelection allows for the creation of a list of 2D arrays that can be used to divide a two- or
          
                higher-dimensional array into several regions structured in a grid. An instance of
          
                GridSelection contains information about how to subdivide an input matrix. Calling
          
                `find_grid(data)`, where data is a 2D array, results in a list of arrays with the same
          
                dimension as `data`, each representing a single region. Each resulting 2D array contains the
          
                value 0 for pixels that do not belong to the region, and the value 1 or any number between 0
          
                and 1 for pixels that (partly) belong to the region.

eitprocessing/roi_selection/gridselection.py

Comment on lines +29 to +41

+                  If the number of rows or columns can not split evenly, a row or column can be split among two
+                  regions. This behaviour is controlled by `split_rows` and `split_columns`.
+                  If `split_rows` is `False` (default), rows will not be split between two groups. A warning will
+                  be shown stating regions don't contain equal numbers of rows. The regions towards the top will
+                  be larger. E.g., when a (5, 2) array is split in two vertical regions, the first region will
+                  contain the first three rows, and the second region the last two rows.
+                  If `split_rows` is `True`, e.g. a (5, 2) array that is split in two vertical regions, the first
+                  region will contain the first two rows and half of each pixel of the third row. The second
+                  region contains half of each pixel in the third row, and the last two rows.
+                  `split_columns` has the same effect on columns as `split_rows` has on rows.

Member

DaniBodor Nov 8, 2023

would it be clearer to add this to "Args" or examples section of the docstring instead? Not sure if that is more or less clear, tbh.

eitprocessing/roi_selection/gridselection.py

Comment on lines +43 to +44

		Regions are ordered according to C indexing order. The `matrix_layout()` method provides a map
		showing how the regions are ordered.

Member

DaniBodor Nov 8, 2023

Unclear what "C indexing order" is. Maybe refer to example below?
I also rephrased slightly to make it clearer that the method refers to the same thing.

Suggested change

      
                Regions are ordered according to C indexing order. The `matrix_layout()` method provides a map
          
                showing how the regions are ordered.
          
                Regions are ordered according to C indexing order, and a map of this ordering can be produced
          
                using the `matrix_layout()` method (see example below).

eitprocessing/roi_selection/gridselection.py

Comment on lines +66 to +90

+                      >>> gs = GridSelection(3, 1, split_pixels=False)
+                      >>> matrices = gs.find_grid(pixel_map)
+                      >>> matrices[0] * pixel_map
+                      array([[1, 2, 3],
+                             [4, 5, 6],
+                             [0, 0, 0],
+                             [0, 0, 0],
+                             [0, 0, 0],
+                             [0, 0, 0]])
+                      >>> gs.matrix_layout()
+                      array([[0],
+                             [1],
+                             [2]])
+                      >>> gs2 = GridSelection(2, 2, split_pixels=True)
+                      >>> matrices2 = gs.find_grid(pixel_map)
+                      >>> gs2.matrix_layout()
+                      array([[0, 1],
+                             [2, 3]])
+                      >>> matrices2[2]
+                      array([[0. , 0. , 0. ],
+                             [0. , 0. , 0. ],
+                             [0. , 0. , 0. ],
+                             [1. , 0.5, 0. ],
+                             [1. , 0.5, 0. ],
+                             [1. , 0.5, 0. ]])

Member

DaniBodor Nov 8, 2023

Few suggestions (all collapsed into one below), see also 4f78ebb:

Correct typo of split_pixels.
rename matrices -> rois
move matrix layout above the resulting map
add one more example to each gs, for added clarity.

Suggested change

      
                    >>> gs = GridSelection(3, 1, split_pixels=False)
          
                    >>> matrices = gs.find_grid(pixel_map)
          
                    >>> matrices[0] * pixel_map
          
                    array([[1, 2, 3],
          
                           [4, 5, 6],
          
                           [0, 0, 0],
          
                           [0, 0, 0],
          
                           [0, 0, 0],
          
                           [0, 0, 0]])
          
                    >>> gs.matrix_layout()
          
                    array([[0],
          
                           [1],
          
                           [2]])
          
                    >>> gs2 = GridSelection(2, 2, split_pixels=True)
          
                    >>> matrices2 = gs.find_grid(pixel_map)
          
                    >>> gs2.matrix_layout()
          
                    array([[0, 1],
          
                           [2, 3]])
          
                    >>> matrices2[2]
          
                    array([[0. , 0. , 0. ],
          
                           [0. , 0. , 0. ],
          
                           [0. , 0. , 0. ],
          
                           [1. , 0.5, 0. ],
          
                           [1. , 0.5, 0. ],
          
                           [1. , 0.5, 0. ]])
          
                    >>> gs = GridSelection(3, 1, split_rows=False)
          
                    >>> rois = gs.find_grid(pixel_map)
          
                    >>> gs.matrix_layout()
          
                    array([[0],
          
                           [1],
          
                           [2]])
          
                    >>> rois[0] * pixel_map
          
                    array([[1, 2, 3],
          
                           [4, 5, 6],
          
                           [0, 0, 0],
          
                           [0, 0, 0],
          
                           [0, 0, 0],
          
                           [0, 0, 0]])
          
                    >>> rois[1] * pixel_map
          
                    array([[0, 0, 0],
          
                           [0, 0, 0],
          
                           [7, 8, 9],
          
                           [10, 11, 12],
          
                           [0, 0, 0],
          
                           [0, 0, 0]])
          
                    >>> gs2 = GridSelection(2, 2, split_columns=True)
          
                    >>> rois2 = gs.find_grid(pixel_map)
          
                    >>> gs2.matrix_layout()
          
                    array([[0, 1],
          
                           [2, 3]])
          
                    >>> rois2[2]
          
                    array([[0. , 0. , 0. ],
          
                           [0. , 0. , 0. ],
          
                           [0. , 0. , 0. ],
          
                           [1. , 0.5, 0. ],
          
                           [1. , 0.5, 0. ],
          
                           [1. , 0.5, 0. ]])
          
                    >>> rois2[3]
          
                    array([[0. , 0. , 0. ],
          
                           [0. , 0. , 0. ],
          
                           [0. , 0. , 0. ],
          
                           [0. , 0.5, 1. ],
          
                           [0. , 0.5, 1. ],
          
                           [0. , 0.5, 1. ]])

eitprocessing/roi_selection/gridselection.py

Comment on lines +108 to +110

+                  def __post_init__(self):
+                      self._check_attribute_type("v_split", int)
+                      self._check_attribute_type("h_split", int)

Member

DaniBodor Nov 8, 2023

Why are we so strict on using the correct types beyond giving type hints?
I don't see the point in raising an error if someone passes 2.0 or something. This could happen if some external package uses floats by default for whatever reason.

If we do want to remain strict, see 62bc8d4 for a suggestion of how to simplify coding this.

eitprocessing/roi_selection/gridselection.py

+                      split_columns: Allows columns to be split over two regions.
+                  Examples:
+                      >>> pixel_map = array([[ 1,  2,  3],

Member

DaniBodor Nov 8, 2023

Suggested change

      
                    >>> pixel_map = array([[ 1,  2,  3],
          
                    >>> pixel_map = np.array([[ 1,  2,  3],

eitprocessing/roi_selection/gridselection.py

Comment on lines +316 to +325

+              class InvalidDivision(Exception):
+                  """Raised when the data can't be divided into regions."""
+              class InvalidHorizontalDivision(InvalidDivision):
+                  """Raised when the data can't be divided into horizontal regions."""
+              class InvalidVerticalDivision(InvalidDivision):
+                  """Raised when the data can't be divided into vertical regions."""

Member

DaniBodor Nov 8, 2023

what's the added value of these over a ValueError?

and what's the added value of having separate horizontal and vertical versions, over specifying this in the error message (same question for warnings below)?

eitprocessing/roi_selection/gridselection.py

Comment on lines +95 to +98

+                  split_rows: bool = False
+                  split_columns: bool = False
+                  ignore_nan_rows: bool = True
+                  ignore_nan_columns: bool = True

Member

DaniBodor Nov 22, 2023

I think it would be nice to have a single argument for each of split and ignore, as I think that more often than not a user would want the same behavior for both.
We can set it so that it accepts either a bool or a 2-part tuple of bools (for horizontal and vertical).

Suggested change

      
                split_rows: bool = False
          
                split_columns: bool = False
          
                ignore_nan_rows: bool = True
          
                ignore_nan_columns: bool = True
          
                split_pixels: bool | tuple[bool, bool] = False
          
                ignore_nan_edges: bool | tuple[bool, bool] = True

Also, what would happen and what do we want to happen if the center row/column of an array is nan?

eitprocessing/roi_selection/gridselection.py

Comment on lines +309 to +313

+                  def matrix_layout(self) -> NDArray:
+                      """Returns a 2D array showing the layout of the matrices returned by
+                      `find_grid`."""
+                      n_regions = self.v_split * self.h_split
+                      return np.reshape(np.arange(n_regions), (self.v_split, self.h_split))

Member

DaniBodor Nov 22, 2023

I think it'd be nice to make this a property rather than a function. See 33905ee for suggestion

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels