underspecified-iv

This repository is the corresponding implementation to Sequential Underspecified Instrument Selection for Cause-Effect Estimation (EA, Jason Hartford, Niki Kilbertus).

Quick Setup

git clone [email protected]:eailer/underspecified-iv
cd underspecified-iv

The code runs on Python 3.9. We recommend to create a new environment and to install the packages in the spec-file.txt file.

conda create -n underspecified_iv python=3.9
conda install -n underspecified_iv --file spec-file.txt
conda activate underspecified_iv

Reproducing experiments

Figure 1

Figure 1 shows the finite sample properties of a setting with $d_z=3$, therefore three possible instruments and $d_x=3$, resp. $d_x=10$ potential treatment variables. The causal effect $\beta$ is generated as such that-in principle-can be recovered in full.

python src/run_finite_sample_estimation.py --p 3 --seed 253 --n 1000 --n_runs 500

resp.

python src/run_finite_sample_estimation.py --p 10 --seed 253 --n 1000 --n_runs 500

Each command has three outputs which will be stored in a folder (example for $p=3$)/Output/p3_n_runs500:

p3_n_runs500.npy for the scenario i.e. the exact random variables as well as any constants and parameters to recreate the results.
results.npy: result file with trajectory values, final optimization value etc.
One .pdf file containing the plot.

Figure 2 and Figure 3

Figure 2 and Figure 3 show the results of the sequential selection algorithm for the parameter setting $d_z = 30, d_x = 50, d_{id} = 15, N_{IV/exp} = 3$ and $T=6$.

python src/run_optimization.py --d 30 --p 50 --d_id 15 --n_runs 500 --n_rounds 6 --d_max 3 --seed 1911

This command has three outputs which will be stored in a folder Output/p50_d30_d_id15_d_max3_n_rounds6

p50_d30_d_id15_d_max3_n_rounds6.npy i.e. the exact random variables as well as any constants and parameters to recreate the results.
results.npy: result file with trajectory values, final optimization value etc.
Five .pdf files containing the plots.

Figures in the Appendix are reproduced accordingly.

Customize

Apart from the reproduction of the experiments, in which we assume ground truth to be known, it is also possible to extract the sequential selection algorithm and customise relevant functionalities to one specific experimental setup example. The relevant functions are stored in optimization_submodular.py.

There are two classes involved to be addressed.

Experiment: The function class takes in parameters and outputs a run of the experiment. This has to be exchanged for real data.
SetProposal: The function class contains the sequential selection. The selection mechanism (SetProposal()._select()) can be modifed to integrate different similarity metrics as well as (SetProposal()._cost_budget()) to integrate different budget constraints for instrumental variable sets.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
src		src
.gitignore		.gitignore
README.md		README.md
spec-file.txt		spec-file.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

underspecified-iv

Quick Setup

Reproducing experiments

Figure 1

Figure 2 and Figure 3

Customize

About

Releases

Packages

Languages

EAiler/underspecified-iv

Folders and files

Latest commit

History

Repository files navigation

underspecified-iv

Quick Setup

Reproducing experiments

Figure 1

Figure 2 and Figure 3

Customize

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages