refactor colsubset exprs tracking so it an be overridden in split funs #795

gmbecker · 2023-11-29T19:35:20Z

This closes #785, allowing custom split functions, specifically functions which override the core function, to specify subsetting expressions for the facets they generate, which is necessary for the custom split to function properly in column space.

This is backwards compatible other than the fact that attempting to use a custom splitting function which overrides core splitting behavior in column space was previously an error, and any TableTree objects that have been serialized to rds/rda files would need to be refreshed as one of the underlying classes has changed. Tests have been added/updated for the new behavior

shajoezhu · 2023-12-01T05:12:35Z

Hi @gmbecker , thanks a lot for the changes. We will need to create PRs and trigger CICD pipelines to investigate if any downstream break changes. Will get back to this as I know more.

Melkiades · 2023-12-22T11:18:34Z

R/make_split_fun.R

 #'
 #' @examples
 #' splres <- make_split_result(
 #'   values = c("hi", "lo"),
 #'   datasplit = list(hi = mtcars, lo = mtcars[1:10, ]),
-#'   labels = c("more data", "less data")
+#'   labels = c("more data", "less data"),
+#'   subset_exprs = list(expression(TRUE), expression(seq_along(wt) <= 10))


so this would do a subset only in the column space, while it is ignored in the row space, right?

Correct. Faceting in column space is tracked via expressions, while faceting in row space is immediately materialized during tabulation. This is largely for a combination of performance and legacy reasons.

Melkiades · 2023-12-22T11:20:28Z

R/make_split_fun.R

  validate_split_result(splres)
-  newstuff <- make_split_result(values, datasplit, labels, extras)
+  newstuff <- make_split_result(values, datasplit, labels, extras, subset_exprs = list(sub_expr))


why sub_expr is not defaulting to the splres values if sub_expr is NULL?

So there are two cases that can happen here:

values are strings (this is more common as this is being called in custom code), in which case value_expr(values[.]) will always be NULL, so that wouldn't change anything.

values are already SplitValue objects, in which case they (should) already have their expressions set (if they are not the default, which will still work in most cases), so they don't need to be changed.

Currently, as written, subset_expr ends up being ignored when values are already SplitValue objects.

This is defensible behavior, I think (as the construction time of the SplitValue is where the custom expression "should" be set), but probably not optimal. I'll change things slightly to make it easier to override subset expressions if (for some reason) you're reusing the same value object but want different subsetting behavior (which would be weird but I can think of a a corner case you might want to). As written originally you'd have been expected to reset the expressions on the value objects before passing to make_split_result

R/make_split_fun.R

Melkiades · 2023-12-22T11:28:42Z

R/make_split_fun.R

  }
 }

+
+.or_combine_exprs <- function(ex1, ex2) {


I remember us talking about making expression handling more general and contained in a single system of functions. Do you think it is worth it?

I mean adding this to make_subset_expr (or is this too specific?)

Melkiades · 2023-12-22T11:34:44Z

R/00tabletrees.R

    sub = .combine_subset_exprs(
-      pos_subset(parpos),
-      make_subset_expr(newspl, nsplitval)
+        pos_subset(parpos),
+        ## this will grab the value's custom subset expression if present
+        make_subset_expr(newspl, nsplitval)


this is the core effect of altering column col splits

Melkiades · 2023-12-22T11:36:04Z

tests/testthat/test_utils.R

@@ -1,6 +1,7 @@
 context("Checking utility functions")

 test_that("func_takes works with different inputs", {
+    func_takes <- rtables:::func_takes


This is not needed as tests access to all the namespace (with hidden functions)

Suggested change

func_takes <- rtables:::func_takes

Melkiades

I think this is very good! Thanks Gabe, I will unlock the draft to see if tests sail smoothly ;)

gmbecker requested review from edelarua, Melkiades and ayogasekaram as code owners November 29, 2023 19:35

shajoezhu removed request for Melkiades, edelarua and ayogasekaram November 30, 2023 15:53

shajoezhu marked this pull request as draft November 30, 2023 15:53

shajoezhu self-assigned this Nov 30, 2023

shajoezhu self-requested a review December 1, 2023 05:12

shajoezhu added the sme label Dec 1, 2023

Melkiades reviewed Dec 22, 2023

View reviewed changes

R/make_split_fun.R Outdated Show resolved Hide resolved

Melkiades reviewed Dec 22, 2023

View reviewed changes

R/make_split_fun.R Outdated Show resolved Hide resolved

Melkiades reviewed Dec 22, 2023

View reviewed changes

Melkiades marked this pull request as ready for review December 22, 2023 11:37

gmbecker closed this Jan 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor colsubset exprs tracking so it an be overridden in split funs #795

refactor colsubset exprs tracking so it an be overridden in split funs #795

gmbecker commented Nov 29, 2023

shajoezhu commented Dec 1, 2023

Melkiades Dec 22, 2023

gmbecker Jan 23, 2024

Melkiades Dec 22, 2023

gmbecker Jan 23, 2024

Melkiades Dec 22, 2023

Melkiades Dec 22, 2023

Melkiades Dec 22, 2023

Melkiades Dec 22, 2023

Melkiades left a comment

refactor colsubset exprs tracking so it an be overridden in split funs #795

refactor colsubset exprs tracking so it an be overridden in split funs #795

Conversation

gmbecker commented Nov 29, 2023

shajoezhu commented Dec 1, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Melkiades left a comment

Choose a reason for hiding this comment