Derive auxiliary for selective editing #139

robertswh · 2024-12-18T18:30:38Z

Pull Request Title

Derive auxiliary variable for the selective editing output.

The functionality has been written and added to the output but hasn't been tested yet.

Summary

Derive the required auxiliary variables for Q49 for the selective editing output for SPP

Type of Change

Checklists

This pull request meets the following requirements:

Creator Checklist

Installable with all dependencies recorded
Runs without error
Follows PEP8 and project-specific conventions
Appropriate use of comments, for example, no descriptive comments
Functions documented using Numpy style docstrings
Assumptions and decisions log considered and updated if appropriate
Unit tests have been updated to cover essential functionality for a reasonable range of inputs and conditions
Other forms of testing such as end-to-end and user-interface testing have been considered and updated as required

If you feel some of these conditions do not apply for this pull request, please
add a comment to explain why.

Reviewer Checklist

Test suite passes (locally as a minimum)
Peer reviewed with review recorded

Additional Information

Please provide any additional information or context that would help the reviewer understand the changes in this pull request.

Related Issues

Link any related issues or pull requests here.

robertswh · 2024-12-18T18:33:30Z

mbs_results/outputs/selective_editing.py

+        dataframe[
+            (dataframe[period] == previous_period) & (dataframe[question_no] == 49)
+        ][[imputation_class, construction_link, question_no]]
+        .drop_duplicates()


maybe we should look for duplicates in imputation_class and question_no as a quick check?

Not sure if we have a generic validation, but something similar is done when producing SE outputs

mbs_results/outputs/selective_editing.py

mbs_results/outputs/selective_editing_contributer_output.py

robertswh · 2024-12-22T13:35:27Z

mbs_results/outputs/selective_editing_question_output.py


-    standardising_factor["imputation_flags_adjustedresponse"] = standardising_factor[
+    question_output = pd.merge(standardising_factor, auxiliary_value, on=["reference", "imputation_class", "questioncode"], how="left")


doesn't make sense to merge on reference and imputation_class, one or the other

Jday7879

Code looks good to me, Nice work :) I've made a couple of small comments and one nitpick about using .loc to remove the double indexing, not sure which method is actually better though

Jday7879 · 2025-01-14T16:38:21Z

mbs_results/outputs/selective_editing.py

@@ -28,7 +30,7 @@ def calculate_predicted_value(
          and imputed_value

    """
-
+    # TODO: This has already been combined somewhere updstream


Is this function still needed in the pipeline. From what I can see its only called in unit tests

Jday7879 · 2025-01-14T16:40:04Z

mbs_results/outputs/selective_editing.py

+        name of column in dataframe containing construction link variable
+    imputation_class : str
+        name of column in dataframe containing imputation class, where
+        there is one contruction link per imputation_class and period


Suggested change

there is one contruction link per imputation_class and period

there is one construction link per imputation_class and period

Jday7879 · 2025-01-14T16:48:05Z

mbs_results/outputs/selective_editing.py

+    q40["auxiliary_value"] = q40[frozen_turnover]
+
+    previous_period = period_selected - pd.DateOffset(months=1)
+    prev_const_link = (


Nitpick: Could maybe refactor to use .loc not sure which one is better in the long term?

dataframe.loc[ (dataframe[period] == previous_period) & (dataframe[question_no] == 49), [imputation_class, construction_link, question_no] ]

Jday7879 · 2025-01-14T16:48:55Z

mbs_results/outputs/selective_editing.py

+        dataframe[
+            (dataframe[period] == previous_period) & (dataframe[question_no] == 49)
+        ][[imputation_class, construction_link, question_no]]
+        .drop_duplicates()


Not sure if we have a generic validation, but something similar is done when producing SE outputs

Jday7879

Changing PR from approved to request changes to avoid it being merged early following todays stand-up

I am continuing this work. Removing this review so that others can approve without me changing the review

- type issue for unit test - Updated calculated value for reference 4 q49 to 625000

Jday7879

Looks good to me, merging :)

robertswh added 2 commits December 18, 2024 13:56

added previous construction links for Q49, todo add aux var

59b6355

add function to calculate auxiliary variable

02305db

robertswh commented Dec 18, 2024

View reviewed changes

robertswh requested a review from lhubbardONS December 18, 2024 18:33

robertswh commented Dec 19, 2024

View reviewed changes

mbs_results/outputs/selective_editing.py Show resolved Hide resolved

robertswh commented Dec 19, 2024

View reviewed changes

mbs_results/outputs/selective_editing_contributer_output.py Outdated Show resolved Hide resolved

add auxiliary value function and merge on output

209dae6

robertswh changed the title ~~WIP: 712 derive auxiliary for se~~ Derive auxiliary for selective editing Dec 21, 2024

robertswh commented Dec 22, 2024

View reviewed changes

robertswh added 5 commits December 22, 2024 13:45

small corrections

3d51249

small fix

c6fd4e7

formatting after pre-commits

df38e43

pass tests

95fe0d5

pass pre-commits

58a6edb

robertswh marked this pull request as ready for review January 14, 2025 12:15

robertswh requested review from Jday7879 and giuliag92 as code owners January 14, 2025 12:15

Jday7879 approved these changes Jan 14, 2025

View reviewed changes

Jday7879 previously requested changes Jan 16, 2025

View reviewed changes

sarahcollyer and others added 4 commits January 23, 2025 14:52

Amend q49 to use construction link from current period

95d5769

Fix typos and incorrect values in unit test output data

19479fd

Updated:

292c69a

- type issue for unit test - Updated calculated value for reference 4 q49 to 625000

pre commit hook changes

2c4e070

Jday7879 approved these changes Jan 24, 2025

View reviewed changes

Jday7879 merged commit b2a0fb8 into main Jan 24, 2025
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Derive auxiliary for selective editing #139

Derive auxiliary for selective editing #139

robertswh commented Dec 18, 2024 •

edited by Jday7879

Loading

robertswh Dec 18, 2024

Jday7879 Jan 14, 2025

robertswh Dec 22, 2024

Jday7879 left a comment

Jday7879 Jan 14, 2025

Jday7879 Jan 14, 2025

Jday7879 Jan 14, 2025

Jday7879 Jan 14, 2025

Jday7879 left a comment

Jday7879 left a comment


		standardising_factor["imputation_flags_adjustedresponse"] = standardising_factor[
		question_output = pd.merge(standardising_factor, auxiliary_value, on=["reference", "imputation_class", "questioncode"], how="left")

	there is one contruction link per imputation_class and period
	there is one construction link per imputation_class and period

Derive auxiliary for selective editing #139

Derive auxiliary for selective editing #139

Conversation

robertswh commented Dec 18, 2024 • edited by Jday7879 Loading

Pull Request Title

Summary

Type of Change

Checklists

Creator Checklist

Reviewer Checklist

Additional Information

Related Issues

robertswh Dec 18, 2024

Choose a reason for hiding this comment

Jday7879 Jan 14, 2025

Choose a reason for hiding this comment

robertswh Dec 22, 2024

Choose a reason for hiding this comment

Jday7879 left a comment

Choose a reason for hiding this comment

Jday7879 Jan 14, 2025

Choose a reason for hiding this comment

Jday7879 Jan 14, 2025

Choose a reason for hiding this comment

Jday7879 Jan 14, 2025

Choose a reason for hiding this comment

Jday7879 Jan 14, 2025

Choose a reason for hiding this comment

Jday7879 left a comment

Choose a reason for hiding this comment

Jday7879 left a comment

Choose a reason for hiding this comment

robertswh commented Dec 18, 2024 •

edited by Jday7879

Loading