You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This could lead to overfitting on the test set and underestimating the conditional importance of covariate $j$.
Another solution suggested by Ahmad would be to cross-fit on the test set. This could on the contrary lead to poorer fit of the covariate estimator compared to methods like LOCO that leverage the entire train set for accounting for correlations.
Also, I find the naming importance_estimator a bit misleading. It reminds me of the method used for importance estimation (LOCO, SHAP, CPI...) but actually corresponds to the estimator used in CPI to predict $X^j$ from $X^{-j}$
The text was updated successfully, but these errors were encountered:
The covariate estimator$\hat{\mathcal{E}}[X^j|X^{-j}]$ is currently fitted and used for predicting on the test set:
hidimstat/hidimstat/compute_importance.py
Lines 251 to 265 in 7c3af80
This could lead to overfitting on the test set and underestimating the conditional importance of covariate$j$ .
Another solution suggested by Ahmad would be to cross-fit on the test set. This could on the contrary lead to poorer fit of the covariate estimator compared to methods like LOCO that leverage the entire train set for accounting for correlations.
Also, I find the naming$X^j$ from $X^{-j}$
importance_estimator
a bit misleading. It reminds me of the method used for importance estimation (LOCO, SHAP, CPI...) but actually corresponds to the estimator used in CPI to predictThe text was updated successfully, but these errors were encountered: