Option to store nuisance parameter objects #1473

MCKnaus · 2024-11-20T09:55:40Z

This is no issue, but a suggestion for a feature.

Currently the objects include the estimated nuisance parameters Y.hat, W.hat ... However, the underlying regression_forest objects are gone for good (at least as far as I can see).

For a new paper that extracts the outcome weights of the point estimates (https://arxiv.org/abs/2411.11559), I need to know the get_forest_weights of Y.hat. My workaround is currently to estimate Y.hat externally such that I have its object (see this notebook for an illustration why and how).

I perfectly understand that saving nuisance parameter (NP) objects is not attractive to save memory. However, a store_nuisance_parameters option would be quite useful for my purposes. In the best case with control about which of the mutiple NP objects should be saved.

Just FYI that there would be a consumer of such a feature.

Thank you in any case for your great work!

The text was updated successfully, but these errors were encountered:

erikcs · 2024-11-24T08:46:03Z

Hi @MCKnaus, thank you very much for the suggestion (and also for the interesting reference). For cases with custom outcome/propensity models, the intended design was to construct those models outside. If you need to carry those objects with you in downstream tasks, then maybe a very simple option could be to just store them in the returned causal forest object? Like my.forest = causal_forest(X,Y,W,Y.hat=...); my.forest$Y.model = Y.model, then whenever you need access to that causal forest's Y.model you'd just access my.forest$Y.model.

MCKnaus · 2024-11-25T10:21:00Z

Thank you for the suggestion. I will add this as option when revising the OutcomeWeights package. Then, users do not need to explicitly store the smoother matrix. Also it would be immediately compatible if you decide to include this feature and use Y.model as label. In the best case, it eventually boils down to running

my.cf = causal_forest(X,Y,W, store_nuisance_parameter ="Y")
omega = get_outcome_weights(my.cf)

where get_outcome_weights() calls my.cf$Y.model internally.

But feel free to ignore.

MCKnaus mentioned this issue Nov 25, 2024

Allow for Y.model in grf objects MCKnaus/OutcomeWeights#1

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Option to store nuisance parameter objects #1473

Option to store nuisance parameter objects #1473

MCKnaus commented Nov 20, 2024

erikcs commented Nov 24, 2024

MCKnaus commented Nov 25, 2024

Option to store nuisance parameter objects #1473

Option to store nuisance parameter objects #1473

Comments

MCKnaus commented Nov 20, 2024

erikcs commented Nov 24, 2024

MCKnaus commented Nov 25, 2024