Implement adaptive localization #4243

dafeda · 2022-11-10T12:44:02Z

Resolves: #4411

Note that the current implementation is naive in that it loops through all parameters.
My understanding is that this is the most accurate but too computationally expensive in practice.
Perhaps we can use this as reference to compare more efficient methods against.

I've added two new keywords; LOCALIZATION and LOCALIZATION_CORRELATION_THRESHOLD which may be set as for example:

ANALYSIS_SET_VAR STD_ENKF LOCALIZATION True
ANALYSIS_SET_VAR STD_ENKF LOCALIZATION_CORRELATION_THRESHOLD 0.2

Pre review checklist

Added appropriate release note label
PR title captures the intent of the changes, and is fitting for release notes.
Commit history is consistent and clean, in line with the contribution guidelines.

Adding labels helps the maintainers when writing release notes. This is the list of release note labels.

frode-aarstad

Just looking at the code it looks good.

Might be worth it to add some comments to the functions if its not obvious whats going on.
Is it possible to write a test for this specific improvement? ie: correlated_parameter_response_pairs

src/ert/analysis/_es_update.py

codecov-commenter · 2023-01-04T05:30:40Z

Codecov Report

Merging #4243 (6bb7092) into main (b828844) will increase coverage by 0.07%.
The diff coverage is 89.15%.

@@            Coverage Diff             @@
##             main    #4243      +/-   ##
==========================================
+ Coverage   59.19%   59.26%   +0.07%     
==========================================
  Files         440      440              
  Lines       30667    30729      +62     
  Branches     3135     3135              
==========================================
+ Hits        18153    18212      +59     
- Misses      11729    11732       +3     
  Partials      785      785

Impacted Files	Coverage Δ
...ert/gui/ertwidgets/analysismodulevariablespanel.py	`19.81% <12.50%> (-0.57%)`	⬇️
src/ert/_c_wrappers/analysis/analysis_module.py	`82.50% <93.10%> (+1.12%)`	⬆️
src/ert/_c_wrappers/analysis/__init__.py	`100.00% <100.00%> (ø)`
src/ert/analysis/_es_update.py	`90.00% <100.00%> (+2.06%)`	⬆️
src/ert/_c_wrappers/enkf/config/gen_data_config.py	`71.42% <0.00%> (-12.50%)`	⬇️
...rc/ert/_c_wrappers/enkf/config/enkf_config_node.py	`90.20% <0.00%> (-0.52%)`	⬇️
src/ert/gui/ertwidgets/validationsupport.py	`98.63% <0.00%> (+19.17%)`	⬆️

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

src/ert/analysis/_es_update.py

tommyod · 2023-09-28T07:44:06Z

We can speed up the computations by not forming the diagonal matrices. Here's an example showing what I mean, where Sigma_A and Sigma_Y are never formed as diagonal matrices. Runs around 20x faster than the current code on these sizes.

ensemble_size = 10
S = np.random.randn(1000, ensemble_size)
X_local = np.random.randn(1000, ensemble_size)

# Standard deviations
Sigma_Y = np.std(S, axis=1, ddof=1)
Sigma_A = np.std(X_local, axis=1, ddof=1)

# State-measurement covariance matrix
Y_prime = S - S.mean(axis=1, keepdims=True)
A = X_local - X_local.mean(axis=1, keepdims=True)
C_AY = A @ Y_prime.T / (ensemble_size - 1)

# Absolute values of the correlation matrix
c_AY = np.abs((C_AY / Sigma_Y.reshape(1, -1)) / Sigma_A.reshape(-1, 1))

alternatively, if C_AA and C_YY are needed:

ensemble_size = 10
S = np.random.randn(1000, ensemble_size)
X_local = np.random.randn(1000, ensemble_size)

# Estimate covariance matrix for outputs
Y_prime = S - S.mean(axis=1, keepdims=True)
C_YY = Y_prime @ Y_prime.T / (ensemble_size - 1)
Sigma_Y = np.sqrt(np.diag(C_YY))

A = X_local - X_local.mean(axis=1, keepdims=True)
C_AA = A @ A.T / (ensemble_size - 1)
Sigma_A = np.sqrt(np.diag(C_AA))

# State-measurement covariance matrix
C_AY = A @ Y_prime.T / (ensemble_size - 1)

# State-measurement correlation matrix
c_AY = np.abs((C_AY / Sigma_Y.reshape(1, -1)) / Sigma_A.reshape(-1, 1))

Blunde1 · 2023-10-06T12:04:20Z

We can speed up the computations by not forming the diagonal matrices. Here's an example showing what I mean, where Sigma_A and Sigma_Y are never formed as diagonal matrices. Runs around 20x faster than the current code on these sizes.

I don't see there being anything in the fit() or update() that is slower than the previous dense matrix inversion of Sigma_Y (only needs to be formed once, outside loops) and Sigma_A, so I fully support this. It would then even be possible to hope that adaptive localization runs in reasonable time even on Troll. It is at least worth to test it.

Add option of running adaptive localization that can simply be turned on and does not need any user input. Only parameters that are significantly correlated to responses will be updated. Default value of what constitutes significant correlation is calculated based on theory, but can be set by the user.

dafeda · 2023-10-18T12:13:00Z

Has been merged to main.

dafeda added the release-notes:new-feature Automatically categorise as new feature in release notes label Nov 10, 2022

dafeda self-assigned this Nov 10, 2022

dafeda requested review from oyvindeide, pinkwah, ManInFez and frode-aarstad November 10, 2022 12:48

dafeda force-pushed the localization branch 3 times, most recently from c4d6da6 to fe5a485 Compare November 11, 2022 13:14

frode-aarstad reviewed Nov 14, 2022

View reviewed changes

src/ert/analysis/_es_update.py Outdated Show resolved Hide resolved

dafeda force-pushed the localization branch 6 times, most recently from 62bd91a to 6af26ac Compare November 16, 2022 13:45

dafeda force-pushed the localization branch 4 times, most recently from 624ce42 to 9ff133e Compare November 28, 2022 09:53

dafeda force-pushed the localization branch from 9ff133e to 91bb1da Compare December 1, 2022 08:51

dafeda force-pushed the localization branch 2 times, most recently from 04b5241 to a946297 Compare December 13, 2022 09:37

dafeda force-pushed the localization branch from a946297 to af689f2 Compare December 21, 2022 12:36

dafeda force-pushed the localization branch 3 times, most recently from 6da3aa5 to 3724fd2 Compare January 4, 2023 05:15

dafeda force-pushed the localization branch from 3724fd2 to 6aca933 Compare January 6, 2023 07:10

geirev reviewed Jan 10, 2023

View reviewed changes

src/ert/analysis/_es_update.py Outdated Show resolved Hide resolved

dafeda force-pushed the localization branch 2 times, most recently from 87e8eaa to ef0252c Compare April 4, 2023 11:54

dafeda force-pushed the localization branch from ef0252c to 9789445 Compare April 12, 2023 12:05

dafeda removed the request for review from ManInFez April 12, 2023 13:02

dafeda force-pushed the localization branch 2 times, most recently from b470cd7 to a4eb36e Compare April 17, 2023 11:50

dafeda force-pushed the localization branch 2 times, most recently from 3f5e943 to f164944 Compare April 27, 2023 13:30

dafeda force-pushed the localization branch 2 times, most recently from 74e63f6 to 1aeeb5e Compare June 5, 2023 17:46

dafeda mentioned this pull request Jun 7, 2023

Use Bonferroni (or similar) correction to the correlation threshold #5557

Open

dafeda force-pushed the localization branch 2 times, most recently from 14baf1a to 0007e8e Compare June 20, 2023 06:43

dafeda force-pushed the localization branch from 0007e8e to afb5a33 Compare July 31, 2023 10:22

dafeda force-pushed the localization branch from afb5a33 to 4dd776a Compare August 8, 2023 10:14

dafeda force-pushed the localization branch 3 times, most recently from b6f14fe to 20cfcde Compare September 22, 2023 11:07

dafeda marked this pull request as ready for review October 10, 2023 06:43

Blunde1 mentioned this pull request Oct 16, 2023

Compute cross-correlation matrices without matrix inversion #6330

Closed

Temporarily add flow_config.yml to enable running with flow

4180da7

Blunde1 mentioned this pull request Oct 16, 2023

Compute cross-correlation matrices without matrix inversion #6339

Merged

dafeda force-pushed the localization branch from 20cfcde to d80f169 Compare October 17, 2023 06:20

dafeda closed this Oct 18, 2023

dafeda deleted the localization branch October 18, 2023 12:13

sondreso mentioned this pull request Mar 20, 2024

Localisation #7492

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement adaptive localization #4243

Implement adaptive localization #4243

dafeda commented Nov 10, 2022 •

edited

Loading

frode-aarstad left a comment •

edited

Loading

codecov-commenter commented Jan 4, 2023 •

edited

Loading

tommyod commented Sep 28, 2023

Blunde1 commented Oct 6, 2023

dafeda commented Oct 18, 2023

Implement adaptive localization #4243

Implement adaptive localization #4243

Conversation

dafeda commented Nov 10, 2022 • edited Loading

Pre review checklist

frode-aarstad left a comment • edited Loading

Choose a reason for hiding this comment

codecov-commenter commented Jan 4, 2023 • edited Loading

Codecov Report

tommyod commented Sep 28, 2023

Blunde1 commented Oct 6, 2023

dafeda commented Oct 18, 2023

dafeda commented Nov 10, 2022 •

edited

Loading

frode-aarstad left a comment •

edited

Loading

codecov-commenter commented Jan 4, 2023 •

edited

Loading