feat: expressify `lower_bound` and `upper_bound` in `is_between` #1672

MarcoGorelli · 2024-12-28T21:31:30Z

What type of PR is this? (check all applicable

closes #1659

Related issues

Related issue #<issue number>
Closes #<issue number>

Checklist

Code follows style guide (ruff)
Tests added
Documented the changes

If you have comments or can explain your changes, please do so below

MarcoGorelli · 2024-12-28T21:31:59Z

narwhals/_arrow/namespace.py

-    def sum(self: Self, *column_names: str) -> ArrowExpr:
-        return ArrowExpr.from_column_names(
-            *column_names, backend_version=self._backend_version, version=self._version
-        ).sum()
-
-    def mean(self: Self, *column_names: str) -> ArrowExpr:
-        return ArrowExpr.from_column_names(
-            *column_names, backend_version=self._backend_version, version=self._version
-        ).mean()
-
-    def median(self: Self, *column_names: str) -> ArrowExpr:
-        return ArrowExpr.from_column_names(
-            *column_names, backend_version=self._backend_version, version=self._version
-        ).median()
-
-    def max(self: Self, *column_names: str) -> ArrowExpr:
-        return ArrowExpr.from_column_names(
-            *column_names, backend_version=self._backend_version, version=self._version
-        ).max()
-
-    def min(self: Self, *column_names: str) -> ArrowExpr:
-        return ArrowExpr.from_column_names(
-            *column_names, backend_version=self._backend_version, version=self._version
-        ).min()
-


drive-by - it's redundat to define all of these in the CompliantExprs

Nice one! I love to see net negative in PRs 👌

FBruzzesi

Left a minor comment on the unit test.

The approach used here to expressify is_between seems generic enough to be used elsewhere, is that right?! I somehow thought it was going to be fairly more complex, yet extract_compliant together with <>_and_extract_native seem all we need 🙌🏼🚀

FBruzzesi · 2024-12-29T09:22:24Z

narwhals/_arrow/namespace.py

-    def sum(self: Self, *column_names: str) -> ArrowExpr:
-        return ArrowExpr.from_column_names(
-            *column_names, backend_version=self._backend_version, version=self._version
-        ).sum()
-
-    def mean(self: Self, *column_names: str) -> ArrowExpr:
-        return ArrowExpr.from_column_names(
-            *column_names, backend_version=self._backend_version, version=self._version
-        ).mean()
-
-    def median(self: Self, *column_names: str) -> ArrowExpr:
-        return ArrowExpr.from_column_names(
-            *column_names, backend_version=self._backend_version, version=self._version
-        ).median()
-
-    def max(self: Self, *column_names: str) -> ArrowExpr:
-        return ArrowExpr.from_column_names(
-            *column_names, backend_version=self._backend_version, version=self._version
-        ).max()
-
-    def min(self: Self, *column_names: str) -> ArrowExpr:
-        return ArrowExpr.from_column_names(
-            *column_names, backend_version=self._backend_version, version=self._version
-        ).min()
-


Nice one! I love to see net negative in PRs 👌

FBruzzesi · 2024-12-29T09:36:24Z

tests/expr_and_series/is_between_test.py

+def test_is_between_expressified(constructor: Constructor) -> None:
+    data = {"a": [1, 4, 2, 5], "b": [0, 5, 2, 4], "c": [9, 9, 9, 9]}
+    df = nw.from_native(constructor(data))
+    result = df.select(nw.col("a").is_between(nw.col("b"), nw.col("c")))


Should we test with a generic expression instead of just col?
Random proposal that would not change expected_dict:

Suggested change

result = df.select(nw.col("a").is_between(nw.col("b"), nw.col("c")))

result = df.select(nw.col("a").is_between(nw.col("b") * 0.9, nw.col("c") - 1))

FBruzzesi · 2024-12-29T09:55:13Z

On a second thought, we are creating some asymmetry between what we can pass to Series.is_between vs Expr.is_between, aren't we? I would expect to be able to pass series' to Series.is_between, e.g.

some_series.is_between(lower_bound_series, upper_bound_series)

Apologies if this is already possible, if so we may want to add a test case for it

MarcoGorelli · 2024-12-29T14:00:32Z

thanks for your review! yup, this is what we do in __eq__ to accept either scalars or expressions / series, we can almost certainly do it in more places (and probably reuse more code)

feat: expressify lower_bound and upper_bound in is_between

5a97cf1

MarcoGorelli commented Dec 28, 2024

View reviewed changes

MarcoGorelli added 2 commits December 28, 2024 21:34

docs

fb12579

🔥

73e2076

MarcoGorelli added the enhancement New feature or request label Dec 28, 2024

MarcoGorelli marked this pull request as ready for review December 28, 2024 21:42

FBruzzesi reviewed Dec 29, 2024

View reviewed changes

MarcoGorelli added 3 commits December 29, 2024 13:40

Merge remote-tracking branch 'upstream/main' into expressify-is-between

0d340c3

add Series test, modify the expressions a little

c460e28

better hints

40a0151

MarcoGorelli merged commit 2dd4480 into narwhals-dev:main Dec 29, 2024
24 checks passed

FBruzzesi mentioned this pull request Dec 30, 2024

[Enh]: Generalization for Expr/Series arguments #730

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: expressify `lower_bound` and `upper_bound` in `is_between` #1672

feat: expressify `lower_bound` and `upper_bound` in `is_between` #1672

MarcoGorelli commented Dec 28, 2024

MarcoGorelli Dec 28, 2024

FBruzzesi Dec 29, 2024

FBruzzesi left a comment

FBruzzesi Dec 29, 2024

FBruzzesi Dec 29, 2024

FBruzzesi commented Dec 29, 2024

MarcoGorelli commented Dec 29, 2024

	result = df.select(nw.col("a").is_between(nw.col("b"), nw.col("c")))
	result = df.select(nw.col("a").is_between(nw.col("b") * 0.9, nw.col("c") - 1))

feat: expressify lower_bound and upper_bound in is_between #1672

feat: expressify lower_bound and upper_bound in is_between #1672

Conversation

MarcoGorelli commented Dec 28, 2024

What type of PR is this? (check all applicable

Related issues

Checklist

If you have comments or can explain your changes, please do so below

MarcoGorelli Dec 28, 2024

Choose a reason for hiding this comment

FBruzzesi Dec 29, 2024

Choose a reason for hiding this comment

FBruzzesi left a comment

Choose a reason for hiding this comment

FBruzzesi Dec 29, 2024

Choose a reason for hiding this comment

FBruzzesi Dec 29, 2024

Choose a reason for hiding this comment

FBruzzesi commented Dec 29, 2024

MarcoGorelli commented Dec 29, 2024

feat: expressify `lower_bound` and `upper_bound` in `is_between` #1672

feat: expressify `lower_bound` and `upper_bound` in `is_between` #1672