[FEA] Generalized Adjustment Criterion #1292

nparent1 · 2024-12-30T04:10:38Z

Relates to feature #402

This PR adds support for identifying generalized (non-backdoor) adjustment sets. Specifically, it adds support for finding a minimal adjustment set if one exists (it is guaranteed to find a set if one does exist). Ongoing work in the pywhy-graphs library to enumerate all m-separating sets in causal graphs will later unlock the ability to enumerate all generalized adjustment sets.

The PR roughly adds the following

A new field to the IdentifiedEstimand class to store general adjustment sets (given how ubiquitous the backdoor criterion is I left the backdoor fields alone and created new fields for these general sets)
Support in the auto_identifier.py file's identify_ate_effect() method to compute a general adjustment set to return in an IdentifiedEstimand object
- Finding the adjustment set is all handled in a new identify_generalized_adjustment_set() method
Refactoring to share various utility functions between the backdoor adjustment sets and the general adjustment sets
Unit tests to test a few cases of the generalized adjustment criterion (specifically cases where the backdoor criterion is not met)

In a subsequent PR I can add support for these general adjustment sets in the causal estimation stage.

References:

Benito van der Zander, Maciej Liśkiewicz, and Johannes Textor. "Constructing Separators and
Adjustment Sets in Ancestral Graphs." In Proceedings of UAI 2014, pages 907–916,
2014.

Signed-off-by: Nicholas Parente <[email protected]>

pyproject.toml

amit-sharma

Thanks for starting this @nparent1
Two requests as I review this PR:

Can you add a notebook showing how the new feature can be used? you can add a notebook under docs/source/example_notebooks/. As examples, you can refer to this notebook on ID algorithm or the optimal adjustment criterion. It will be ideal if the notebook can contrast with the output of the backdoor criterion and show when the generalized adjustment criterion is useful.
Can you add documentation to all the user-facing functions? You can refer to auto-identifier.py or other files. I would suggest adding :param: and :returns: documentation for the functions at the minimum.

Signed-off-by: Nicholas Parente <[email protected]>

nparent1 · 2025-01-03T17:49:52Z

Done! @amit-sharma

I've added an example notebook and documentation.

The example notebook is at: docs/source/example_notebooks/dowhy_generalized_covariate_adjustment_example.ipynb

The only user-facing function I had envisioned at this point was the identify_effect_auto() method in auto_identifier.py (and CausalModel's identify_effect() method, but my changes don't touch its method header). I also added documentation to my two utility functions in the graph.py file though, since I saw some of those methods have detailed documentation.

One other flag - in putting together the notebook I think I found a small bug in the frontdoor criterion codepath, which I've addressed in this PR since it prevented me from running identify_effect() on my example graph (I'll add a comment to the PR pointing out the change so you can verify that it is appropriate)

nparent1 · 2025-01-03T17:53:18Z

dowhy/causal_identifier/auto_identifier.py

@@ -795,7 +847,10 @@ def identify_frontdoor(
        raise ValueError(f"d-separation algorithm {dseparation_algo} is not supported")

    eligible_variables = (
-        get_descendants(graph, action_nodes) - set(outcome_nodes) - set(get_descendants(graph, outcome_nodes))
+        get_descendants(graph, action_nodes)
+        - set(action_nodes)


I added this line to remove the action nodes from the set of eligible nodes (and added a unit test which illustrates why this is needed). If there are multiple action nodes and one of them ends up in the list of eligible variables, we can run into a networkx error when calling the is_d_separator method, in which we are checking if one of the action nodes is a d-separator for the set of action nodes

nparent1 · 2025-01-03T17:54:08Z

tests/causal_identifiers/example_graphs.py

@@ -422,4 +483,15 @@
        valid_frontdoor_sets=[],
        invalid_frontdoor_sets=[{"Z"}, {"M1"}, {"M2"}, {"M1", "M2"}],
    ),
+    # This example is reproduced from the generalized_adjustment examples, and is


Here is the unit test I've added to illustrate why we need to filter out the action nodes from the set of eligible notes for the frontdoor criterion (without my change this test throws an error)

Signed-off-by: Nicholas Parente <[email protected]>

nparent1 added 6 commits December 29, 2024 23:09

first commit

0261ad0

Signed-off-by: Nicholas Parente <[email protected]>

adding default case

5f3bc5b

Signed-off-by: Nicholas Parente <[email protected]>

adding minimal test

ed55a3e

Signed-off-by: Nicholas Parente <[email protected]>

poe format

9fdf085

Signed-off-by: Nicholas Parente <[email protected]>

adding test, throwing on unsupported

f43f279

Signed-off-by: Nicholas Parente <[email protected]>

tweaks

d4faa80

Signed-off-by: Nicholas Parente <[email protected]>

nparent1 changed the title ~~[WIP] General Adjustment~~ Generalized Adjustment Criterion Dec 30, 2024

nparent1 mentioned this pull request Dec 30, 2024

Implement complete adjustment criterion that generalizes backdoor #402

Open

nparent1 changed the title ~~Generalized Adjustment Criterion~~ [FEA] Generalized Adjustment Criterion Dec 30, 2024

nparent1 added 2 commits December 30, 2024 16:14

dependency bump

cf66fca

Signed-off-by: Nicholas Parente <[email protected]>

delete misc files

f5b5bb0

Signed-off-by: Nicholas Parente <[email protected]>

nparent1 marked this pull request as draft December 30, 2024 23:15

nparent1 added 3 commits December 30, 2024 23:44

fix dictionary mapping

a6f6230

Signed-off-by: Nicholas Parente <[email protected]>

make test check python version

70dc39f

Signed-off-by: Nicholas Parente <[email protected]>

adding another test

ff50aba

Signed-off-by: Nicholas Parente <[email protected]>

nparent1 commented Dec 31, 2024

View reviewed changes

pyproject.toml Show resolved Hide resolved

nparent1 marked this pull request as ready for review December 31, 2024 05:38

amit-sharma self-requested a review December 31, 2024 07:50

amit-sharma reviewed Dec 31, 2024

View reviewed changes

adding docs

e275326

Signed-off-by: Nicholas Parente <[email protected]>

nparent1 force-pushed the np/gen_adjustment_two branch from 9972173 to e275326 Compare January 3, 2025 17:37

nparent1 added 2 commits January 3, 2025 12:41

restore notebooks I dont want to change

a93c614

Signed-off-by: Nicholas Parente <[email protected]>

remove extraneous comment

c6472ca

Signed-off-by: Nicholas Parente <[email protected]>

nparent1 commented Jan 3, 2025

View reviewed changes

nparent1 requested a review from amit-sharma January 4, 2025 02:04

remove comment and print statement from example notebook

b421d8b

Signed-off-by: Nicholas Parente <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Generalized Adjustment Criterion #1292

[FEA] Generalized Adjustment Criterion #1292

nparent1 commented Dec 30, 2024 •

edited

Loading

amit-sharma left a comment

nparent1 commented Jan 3, 2025

nparent1 Jan 3, 2025

nparent1 Jan 3, 2025

[FEA] Generalized Adjustment Criterion #1292

Are you sure you want to change the base?

[FEA] Generalized Adjustment Criterion #1292

Conversation

nparent1 commented Dec 30, 2024 • edited Loading

References:

amit-sharma left a comment

Choose a reason for hiding this comment

nparent1 commented Jan 3, 2025

nparent1 Jan 3, 2025

Choose a reason for hiding this comment

nparent1 Jan 3, 2025

Choose a reason for hiding this comment

nparent1 commented Dec 30, 2024 •

edited

Loading