Improve `subsample` across `Coreg` subclasses and pipelines #436

rhugonnet · 2023-09-06T23:21:47Z

This PR makes the subsample argument consistent across all Coreg subclasses, and allows them to function in pipelines independently for each step. It also reworks Coreg.fit() to be more durable for future chunking routines.

All details in the full discussion here: #428
Only difference with the discussion: Now passing directly inlier_mask to each subclass' _fit_func, so that the subsampling mask can be computed on the final mask of valid pixels within the subclass (valid data changes in the subclass because of NaNs in auxiliary data not yet available to Coreg.fit(): slope, curvature, etc...). This way we will always have the exact number of valid points asked when subsampling 😄!
To perform the subsampling consistently on the final valid mask, each Coreg subclass calls the method get_subsample_on_valid_mask.

Tests significantly improved.

As now each class has a default subsample argument during instantiation, methods that currently rely on optimization without binning (NuthKaab, Deramping)* have 5e5 to avoid very long computing time, otherwise 1 for all others.

Also a small fix to geoutils.raster.subsampling was necessary: GlacioHack/geoutils#402.

Resolves #428
Resolves #243
Resolves #137

…o directly

erikmannerfelt

I love this and I love you

Just minor comments!

erikmannerfelt · 2023-09-25T09:22:16Z

dev-environment.yml

@@ -52,4 +52,4 @@ dependencies:
    - noisyopt

    # To run CI against latest GeoUtils
-#     - git+https://github.com/GlacioHack/GeoUtils.git
+    - git+https://github.com/GlacioHack/geoutils.git


Wasn't there a reason we left pulling from github? I think if we make a breaking change on geoutils, xdem will fail on new PRs without it being the "PR's fault". Any way around this? Otherwise, please make an issue for it so we can revert this asap!

Yes, this should only be used to test that CI passes temporarily on the main branch for devs, we don't want to make a release with it! Should we add a reminder in HOW_TO_RELEASE?

Now that GeoUtils 0.0.15 is published, I can revert it back!

tests/test_coreg/test_affine.py

erikmannerfelt · 2023-09-25T09:27:47Z

tests/test_coreg/test_base.py

+            # Check that the estimated biases are similar
+            assert coreg_sub._meta["coefficients"] == pytest.approx(coreg_full._meta["coefficients"], rel=1e-1)
+
+    def test_subsample__pipeline(self) -> None:


The double underscore is a typo, right?

No haha it's for subdividing tests more clearly! Many packages do it, and I liked it a lot. I thought it could improve the clarity/organization of our tests that were getting a bit messy.
I think we talked about this in the biascorr PR a bit already! Maybe we should establish guidelines for our tests?

tests/test_coreg/test_base.py

tests/test_coreg/test_biascorr.py

requirements.txt

tests/test_coreg/test_base.py

xdem/coreg/base.py

rhugonnet · 2023-09-25T22:58:47Z

I love this and I love you

Just minor comments!

Happy you like it!! Thanks a lot for the quick review 😊
Next will come the re-structuration to have a single fit() function for all inputs + be able to pass any optimizer/binning also for Affine classes + some of the rest mentioned in #435.

rhugonnet · 2023-09-26T02:32:32Z

All accounted for, merging and focusing on the next steps! 😄

Add subsample to subclass init

2379773

rhugonnet marked this pull request as draft September 6, 2023 23:21

rhugonnet added 6 commits September 21, 2023 17:52

Merge remote-tracking branch 'upstream/main' into subsample_pipeline

4603724

Progress on subsampling in Coreg classes and pipelines

f7df6cf

Finalize subsampling functionality

e6f9b34

Add tests and minor fixes

a1e16ff

Add test for get_subsample

d28a4fb

Linting

87ebff0

rhugonnet requested review from adehecq and erikmannerfelt September 23, 2023 00:30

rhugonnet added 2 commits September 22, 2023 18:23

Fix other tests

346ead9

Get geoutils from git to check everything passes

6d03b40

rhugonnet marked this pull request as ready for review September 23, 2023 05:18

rhugonnet added 3 commits September 22, 2023 21:19

Linting

b471cfd

Add exception in generate pip from conda script when pulling from rep…

348e4e3

…o directly

Change geoutils version required in advance

722e960

erikmannerfelt reviewed Sep 25, 2023

View reviewed changes

First commit for eriks comments

0abc6e8

rhugonnet mentioned this pull request Sep 25, 2023

Re-organization of coreg.py #327

Closed

Finalize eriks comments

4839c09

Linting

76457e1

rhugonnet merged commit ead2f1b into GlacioHack:main Sep 26, 2023
11 checks passed

rhugonnet deleted the subsample_pipeline branch September 26, 2023 02:32

rhugonnet mentioned this pull request Nov 13, 2023

Coregistration: Tilt and VerticalShift pipelines in 0.0.16 #447

Closed

rhugonnet mentioned this pull request Nov 28, 2023

Allow custom subsampling for each CoregPipeline step separately. #137

Closed

rhugonnet mentioned this pull request Mar 7, 2024

Fuse Coreg point functions and add consistent Raster-Point logic #480

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve `subsample` across `Coreg` subclasses and pipelines #436

Improve `subsample` across `Coreg` subclasses and pipelines #436

rhugonnet commented Sep 6, 2023 •

edited

Loading

erikmannerfelt left a comment

erikmannerfelt Sep 25, 2023

rhugonnet Sep 25, 2023

erikmannerfelt Sep 25, 2023

rhugonnet Sep 25, 2023

rhugonnet commented Sep 25, 2023

rhugonnet commented Sep 26, 2023

Improve subsample across Coreg subclasses and pipelines #436

Improve subsample across Coreg subclasses and pipelines #436

Conversation

rhugonnet commented Sep 6, 2023 • edited Loading

erikmannerfelt left a comment

Choose a reason for hiding this comment

erikmannerfelt Sep 25, 2023

Choose a reason for hiding this comment

rhugonnet Sep 25, 2023

Choose a reason for hiding this comment

erikmannerfelt Sep 25, 2023

Choose a reason for hiding this comment

rhugonnet Sep 25, 2023

Choose a reason for hiding this comment

rhugonnet commented Sep 25, 2023

rhugonnet commented Sep 26, 2023

Improve `subsample` across `Coreg` subclasses and pipelines #436

Improve `subsample` across `Coreg` subclasses and pipelines #436

rhugonnet commented Sep 6, 2023 •

edited

Loading