L2: DAGs and Confounding

class: center, middle, inverse, title-slide

.title[
# L2: DAGs and Confounding
]
.author[
### Jean Morrison
]
.institute[
### University of Michigan
]
.date[
### Lecture on 2025-01-14
(updated: 2025-01-22)
]

---

`$\newcommand{\ci}{\perp\!\!\!\perp}$`
`$\newcommand{\nci}{\not\!\perp\!\!\!\perp}$`

## Lecture Outline

1. Representing Causal Relationships in Directed, Acyclic Graphs (DAGs)
   + The causal Markov property connects DAGs to properties of the joint distribution of nodes.

1. Connecting DAGs to Counterfactuals through Structural Equation Models.

1. Using the Properties of DAGs to Identify Conditional Exchangeability.
  + d-separation allows us to determine conditional independence statements from DAGs.
  + The backdoor criterion allows us to determine conditional exchangeability from DAGs.
  + SWIGs provide an alternative approach for identifying conditional exchangeability.

---
# 1. Representing Causal Relationships in Directed, Acyclic Graphs (DAGs)

---

## Graphical Representations of Casual Effects

+ We can represent causal effects in a graph, with arrows.

+ Nodes in the graph are random variables.

+ Directed edges represent direct causal effects (not mediated by any other variables in the graph).

+ The absence of an edge indicates the absence of a direct causal effect. 
--

<div class="grViz html-widget html-fill-item" id="htmlwidget-693dd0f022d45f58cb24" style="width:90%;height:180px;"></div>
<script type="application/json" data-for="htmlwidget-693dd0f022d45f58cb24">{"x":{"diagram":"digraph {\n\ngraph [layout = \"neato\",\n       outputorder = \"edgesfirst\",\n       bgcolor = \"white\"]\n\nnode [fontname = \"Helvetica\",\n      fontsize = \"10\",\n      shape = \"circle\",\n      fixedsize = \"true\",\n      width = \"0.5\",\n      style = \"filled\",\n      fillcolor = \"aliceblue\",\n      color = \"gray70\",\n      fontcolor = \"gray50\"]\n\nedge [fontname = \"Helvetica\",\n     fontsize = \"8\",\n     len = \"1.5\",\n     color = \"gray80\",\n     arrowsize = \"0.5\"]\n\n  \"1\" [label = \"A\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", fontcolor = \"black\", color = \"black\", pos = \"0,0!\"] \n  \"2\" [label = \"Y\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", fontcolor = \"black\", color = \"black\", pos = \"1,0!\"] \n  \"3\" [label = \"A\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", fontcolor = \"black\", color = \"black\", pos = \"2.2,0!\"] \n  \"4\" [label = \"Y\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", fontcolor = \"black\", color = \"black\", pos = \"3.2,0!\"] \n  \"5\" [label = \"L\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", fontcolor = \"black\", color = \"black\", pos = \"2.7,0.5!\"] \n\"1\"->\"2\" [color = \"black\"] \n\"3\"->\"4\" [color = \"black\"] \n\"5\"->\"4\" [color = \"black\"] \n\"5\"->\"3\" [color = \"black\"] \n}","config":{"engine":"dot","options":null}},"evals":[],"jsHooks":[]}</script>

---

## Why Use Graphs

- Graphs are a natural way of encoding scientific understanding of the world.

- For many people, the graph encoding is fairly intuitive. 
  - This makes them a useful tool for communicating structural assumptions across domains.

- Under some assumptions, graphical properties can be used to easily solve problems that are hard to solve otherwise. 
  - In particular, graphs are very useful for answering the question "what variables should I condition on?".

---

## Graph Definitions

- A graph, `$\mathcal{G} = \lbrace V, E\rbrace$` consists of
  - A set of nodes (vertices) `$V = \lbrace V_1, \dots, V_J\rbrace$`
  - A set of edges `$E = \lbrace (V_{1_1}, V_{1_2}), \dots, (V_{K_1}, V_{K_2}) \rbrace$`, which can be represented as pairs of nodes.
  
- A graph can be either *directed*, in which case elements of `$E$` are ordered pairs or *undirected*, in which case
elements of `$E$` are un-ordered. 
  - Our graphs will almost always be directed.

- Two nodes are *adjacent* if they are connected by an edge. 
  + If the edge is directed, the node at the beginning of the edge is the *parent* and the node at the end is the *child*.

---

## Graph Definitions

- A *path* is a sequence of nodes connected by edges that does not intersect itself (a node cannot appear in a path twice).

- In a directed path, all of the edges are oriented in the same direction: i.e. each edge starts at last node of the previous edge.

- In this graph:

<div class="grViz html-widget html-fill-item" id="htmlwidget-ed1399174a6590cc0f35" style="width:40%;height:180px;"></div>
<script type="application/json" data-for="htmlwidget-ed1399174a6590cc0f35">{"x":{"diagram":"digraph {\n\ngraph [layout = \"neato\",\n       outputorder = \"edgesfirst\",\n       bgcolor = \"white\"]\n\nnode [fontname = \"Helvetica\",\n      fontsize = \"10\",\n      shape = \"circle\",\n      fixedsize = \"true\",\n      width = \"0.5\",\n      style = \"filled\",\n      fillcolor = \"aliceblue\",\n      color = \"gray70\",\n      fontcolor = \"gray50\"]\n\nedge [fontname = \"Helvetica\",\n     fontsize = \"8\",\n     len = \"1.5\",\n     color = \"gray80\",\n     arrowsize = \"0.5\"]\n\n  \"1\" [label = \"A\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0,0!\"] \n  \"2\" [label = \"Y\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"1,0!\"] \n  \"3\" [label = \"L\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0.5,0.5!\"] \n\"1\"->\"2\" [color = \"black\"] \n\"3\"->\"2\" [color = \"black\"] \n\"3\"->\"1\" [color = \"black\"] \n}","config":{"engine":"dot","options":null}},"evals":[],"jsHooks":[]}</script>
</center>

there are two paths from `$A$` to `$Y$` but only one directed path.

---

## Graph Definitions

- If there are no paths between two nodes, they are *disconnected* (or *connected* otherwise).

- Node `$k$` is a descendant of node `$j$` if there is a directed path from `$V_j$` to `$V_k$`.

- If a graph contains no directed cycles, it is *acyclic*
  + We will require all of our graphs to be acyclic.

- DAG = Directed Acyclic Graph
---

## Example:

+ Consider this story:

- There are two possible treatments for a disease. Treatment `$A = 1$` is more effective than `$A = 0$`, but has more side effects.
  - Doctors prefer treatment `$A = 0$` for patients who are older or who have more mild disease.
  - Patient outcome (remission or not) is affected by initial severity, treatment, and treatment adherence.
  - Conditional on everything else, age has no effect on patient outcome or disease severity.

+ Work with your neighbor to arrange the following variables in a DAG (there is more than one right answer):

- Patient outcome
  - Initial disease severity
  - Treatment
  - Treatment Adherence
  - Age

---
## Disease Treatment Example

<center>
<div class="grViz html-widget html-fill-item" id="htmlwidget-e100d7924ab511ca2804" style="width:90%;height:504px;"></div>
<script type="application/json" data-for="htmlwidget-e100d7924ab511ca2804">{"x":{"diagram":"digraph {\n\ngraph [layout = \"neato\",\n       outputorder = \"edgesfirst\",\n       bgcolor = \"white\"]\n\nnode [fontname = \"Helvetica\",\n      fontsize = \"10\",\n      shape = \"circle\",\n      fixedsize = \"true\",\n      width = \"0.5\",\n      style = \"filled\",\n      fillcolor = \"aliceblue\",\n      color = \"gray70\",\n      fontcolor = \"gray50\"]\n\nedge [fontname = \"Helvetica\",\n     fontsize = \"8\",\n     len = \"1.5\",\n     color = \"gray80\",\n     arrowsize = \"0.5\"]\n\n  \"1\" [label = \"Age\", fontname = \"Helvetica\", fontsize = \"12\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"black\", fixedsize = \"TRUE\", width = \"1\", pos = \"0.2,1.2!\"] \n  \"2\" [label = \"Severity\", fontname = \"Helvetica\", fontsize = \"12\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"black\", fixedsize = \"TRUE\", width = \"1\", pos = \"0.2,-1.2!\"] \n  \"3\" [label = \"Treatment\", fontname = \"Helvetica\", fontsize = \"12\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"black\", fixedsize = \"TRUE\", width = \"1\", pos = \"2,0!\"] \n  \"4\" [label = \"Outcome\", fontname = \"Helvetica\", fontsize = \"12\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"black\", fixedsize = \"TRUE\", width = \"1\", pos = \"4,0!\"] \n  \"5\" [label = \"Adherence\", fontname = \"Helvetica\", fontsize = \"12\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"black\", fixedsize = \"TRUE\", width = \"1\", pos = \"3,1.2!\"] \n\"1\"->\"3\" [color = \"black\"] \n\"2\"->\"3\" [color = \"black\"] \n\"2\"->\"4\" [color = \"black\"] \n\"3\"->\"4\" [color = \"black\"] \n\"5\"->\"4\" [color = \"black\"] \n}","config":{"engine":"dot","options":null}},"evals":[],"jsHooks":[]}</script>

</center>

---
## Temporality

- Our definition of causality requires that the exposure occur before the outcome in time.

- Under this restriction, a causal DAG must be consistent with at least one strict ordering of nodes.
  
--

- How do we represent a feedback loop?

--
  + We can create multiple nodes representing unique time points (e.g. `$A_1, A_2, \dots$`), with each node only permitted to have causal effects on future nodes.
  + More on this later.

<div class="grViz html-widget html-fill-item" id="htmlwidget-a0a15c4451ae1db027b1" style="width:90%;height:180px;"></div>
<script type="application/json" data-for="htmlwidget-a0a15c4451ae1db027b1">{"x":{"diagram":"digraph {\n\ngraph [layout = \"neato\",\n       outputorder = \"edgesfirst\",\n       bgcolor = \"white\"]\n\nnode [fontname = \"Helvetica\",\n      fontsize = \"10\",\n      shape = \"circle\",\n      fixedsize = \"true\",\n      width = \"0.5\",\n      style = \"filled\",\n      fillcolor = \"aliceblue\",\n      color = \"gray70\",\n      fontcolor = \"gray50\"]\n\nedge [fontname = \"Helvetica\",\n     fontsize = \"8\",\n     len = \"1.5\",\n     color = \"gray80\",\n     arrowsize = \"0.5\"]\n\n  \"1\" [label = \"A\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", fontcolor = \"black\", color = \"black\", pos = \"0,0!\"] \n  \"2\" [label = \"Y\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", fontcolor = \"black\", color = \"black\", pos = \"1,0!\"] \n  \"3\" [label = <A<FONT POINT-SIZE=\"8\"><SUB>1<\/SUB><\/FONT>>, fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", fontcolor = \"black\", color = \"black\", pos = \"2,0.3!\"] \n  \"4\" [label = <Y<FONT POINT-SIZE=\"8\"><SUB>1<\/SUB><\/FONT>>, fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", fontcolor = \"black\", color = \"black\", pos = \"2.5,-0.3!\"] \n  \"5\" [label = <A<FONT POINT-SIZE=\"8\"><SUB>2<\/SUB><\/FONT>>, fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", fontcolor = \"black\", color = \"black\", pos = \"3,0.3!\"] \n  \"6\" [label = <Y<FONT POINT-SIZE=\"8\"><SUB>2<\/SUB><\/FONT>>, fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", fontcolor = \"black\", color = \"black\", pos = \"3.5,-0.3!\"] \n\"1\"->\"2\" [color = \"black\"] \n\"2\"->\"1\" [color = \"black\"] \n\"3\"->\"4\" [color = \"black\"] \n\"4\"->\"5\" [color = \"black\"] \n\"4\"->\"6\" [color = \"black\"] \n\"5\"->\"6\" [color = \"black\"] \n}","config":{"engine":"dot","options":null}},"evals":[],"jsHooks":[]}</script>

</center>

---
## Causal Markov Property

- The causal Markov property translates graph structure into probability statements.

+ It states that, conditional on it's parents, each node is independent of all nodes that are not not it's descendants.

+ This implies that the joint probability distribution of all nodes can be factored as

$$
P(V) = \prod_{j = 1}^{J} P(V_j \vert pa_j).
$$

---

## Example

Conditional independence statements in the disease treatment graph:

`$$S\ci A \qquad S\ci Ad \qquad A \ci Ad$$`
`$$T \ci Ad\ \mid A, S$$`
<center>

<div class="grViz html-widget html-fill-item" id="htmlwidget-abcf19d6ee813008ac12" style="width:432px;height:360px;"></div>
<script type="application/json" data-for="htmlwidget-abcf19d6ee813008ac12">{"x":{"diagram":"digraph {\n\ngraph [layout = \"neato\",\n       outputorder = \"edgesfirst\",\n       bgcolor = \"white\"]\n\nnode [fontname = \"Helvetica\",\n      fontsize = \"10\",\n      shape = \"circle\",\n      fixedsize = \"true\",\n      width = \"0.5\",\n      style = \"filled\",\n      fillcolor = \"aliceblue\",\n      color = \"gray70\",\n      fontcolor = \"gray50\"]\n\nedge [fontname = \"Helvetica\",\n     fontsize = \"8\",\n     len = \"1.5\",\n     color = \"gray80\",\n     arrowsize = \"0.5\"]\n\n  \"1\" [label = \"Age\nA\", fontname = \"Helvetica\", fontsize = \"12\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"black\", fixedsize = \"TRUE\", width = \"1\", pos = \"0.2,1.2!\"] \n  \"2\" [label = \"Severity\nS\", fontname = \"Helvetica\", fontsize = \"12\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"black\", fixedsize = \"TRUE\", width = \"1\", pos = \"0.2,-1.2!\"] \n  \"3\" [label = \"Treatment\nT\", fontname = \"Helvetica\", fontsize = \"12\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"black\", fixedsize = \"TRUE\", width = \"1\", pos = \"2,0!\"] \n  \"4\" [label = \"Outcome\nO\", fontname = \"Helvetica\", fontsize = \"12\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"black\", fixedsize = \"TRUE\", width = \"1\", pos = \"4,0!\"] \n  \"5\" [label = \"Adherence\nAd\", fontname = \"Helvetica\", fontsize = \"12\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"black\", fixedsize = \"TRUE\", width = \"1\", pos = \"3,1.2!\"] \n\"1\"->\"3\" [color = \"black\"] \n\"2\"->\"3\" [color = \"black\"] \n\"2\"->\"4\" [color = \"black\"] \n\"3\"->\"4\" [color = \"black\"] \n\"5\"->\"4\" [color = \"black\"] \n}","config":{"engine":"dot","options":null}},"evals":[],"jsHooks":[]}</script>

</center>

---

## Example

In our example, we can factor the joint probability as

`$$P(A, S, T, Ad, O) = P(A)P(S)P(Ad)P(T \vert A, S)P(O \vert S, T, Ad)$$`

<div class="grViz html-widget html-fill-item" id="htmlwidget-f0f11049bf2527f65879" style="width:432px;height:360px;"></div>
<script type="application/json" data-for="htmlwidget-f0f11049bf2527f65879">{"x":{"diagram":"digraph {\n\ngraph [layout = \"neato\",\n       outputorder = \"edgesfirst\",\n       bgcolor = \"white\"]\n\nnode [fontname = \"Helvetica\",\n      fontsize = \"10\",\n      shape = \"circle\",\n      fixedsize = \"true\",\n      width = \"0.5\",\n      style = \"filled\",\n      fillcolor = \"aliceblue\",\n      color = \"gray70\",\n      fontcolor = \"gray50\"]\n\nedge [fontname = \"Helvetica\",\n     fontsize = \"8\",\n     len = \"1.5\",\n     color = \"gray80\",\n     arrowsize = \"0.5\"]\n\n  \"1\" [label = \"Age\nA\", fontname = \"Helvetica\", fontsize = \"12\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"black\", fixedsize = \"TRUE\", width = \"1\", pos = \"0.2,1.2!\"] \n  \"2\" [label = \"Severity\nS\", fontname = \"Helvetica\", fontsize = \"12\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"black\", fixedsize = \"TRUE\", width = \"1\", pos = \"0.2,-1.2!\"] \n  \"3\" [label = \"Treatment\nT\", fontname = \"Helvetica\", fontsize = \"12\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"black\", fixedsize = \"TRUE\", width = \"1\", pos = \"2,0!\"] \n  \"4\" [label = \"Outcome\nO\", fontname = \"Helvetica\", fontsize = \"12\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"black\", fixedsize = \"TRUE\", width = \"1\", pos = \"4,0!\"] \n  \"5\" [label = \"Adherence\nAd\", fontname = \"Helvetica\", fontsize = \"12\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"black\", fixedsize = \"TRUE\", width = \"1\", pos = \"3,1.2!\"] \n\"1\"->\"3\" [color = \"black\"] \n\"2\"->\"3\" [color = \"black\"] \n\"2\"->\"4\" [color = \"black\"] \n\"3\"->\"4\" [color = \"black\"] \n\"5\"->\"4\" [color = \"black\"] \n}","config":{"engine":"dot","options":null}},"evals":[],"jsHooks":[]}</script>

</center>
 
---

# 2. Connecting DAGs to Counterfactuals through Structural Equation Models

---

## Example

- Suppose we have a machine for measuring blood pressure.

- `$X$` represents the true systolic blood pressure and `$Y$` represents the measured systolic blood pressure.

- Suppose that the machine has some small amount of error.

- We can represent this system with a graph:

<center>
<div class="grViz html-widget html-fill-item" id="htmlwidget-38b71278e7ea78c5ee98" style="width:50%;height:180px;"></div>
<script type="application/json" data-for="htmlwidget-38b71278e7ea78c5ee98">{"x":{"diagram":"digraph {\n\ngraph [layout = \"neato\",\n       outputorder = \"edgesfirst\",\n       bgcolor = \"white\"]\n\nnode [fontname = \"Helvetica\",\n      fontsize = \"10\",\n      shape = \"circle\",\n      fixedsize = \"true\",\n      width = \"0.5\",\n      style = \"filled\",\n      fillcolor = \"aliceblue\",\n      color = \"gray70\",\n      fontcolor = \"gray50\"]\n\nedge [fontname = \"Helvetica\",\n     fontsize = \"8\",\n     len = \"1.5\",\n     color = \"gray80\",\n     arrowsize = \"0.5\"]\n\n  \"1\" [label = \"X\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", fontcolor = \"black\", color = \"black\", pos = \"0,0!\"] \n  \"2\" [label = \"Y\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", fontcolor = \"black\", color = \"black\", pos = \"1,0!\"] \n\"1\"->\"2\" [color = \"black\"] \n}","config":{"engine":"dot","options":null}},"evals":[],"jsHooks":[]}</script>
</center>
---

## Example

- We could also represent the system with an equation

`$$Y = X + \epsilon_Y$$`
where `$\epsilon_Y$` is a random variable that is the error of the machine.

- This equation is structural because the quantity on the left can be interpreted as a counterfactual.

- We could equivalently write

`$$Y(X = x) = x + \epsilon_Y$$`
- Traditionally, in SEM literature, the counterfactual is not written explicitly.

---

## Example

- Mathematically, `$Y = X + \epsilon_Y$` is equivalent to

`$$X = Y - \epsilon_Y$$`

- But this equation is not structural because the lefthand term cannot be interpreted as a counterfactual.

- In other words, intervening on the readout on the machine does not change the patients blood pressure.

---

## Structural Equation Models

- A **structural equation model** is a system of structural equations describing a set of variables.

- An equation is structural if the lefthand term can be interpreted as the counterfactual value, intervening on the terms on the right.

---

## SEMs Link to Graphs

- DAGs can be given causal interpretation by linking them with structural equation models.

- Let `$\mathcal{G} = \lbrace V, E \rbrace$` be a directed graph with nodes `$V_1, \dots, V_n$`.

- Let `$\epsilon_1, \dots, \epsilon_n$` be a set of random "noise" variables corresponding to each node.

- We assume that, for a given `$V_i$` with parents `$\mathbf{pa}_i \subset V$`, there is a counterfactual `$V_i(\mathbf{pa}_i)$` given by the non-parametric
structural equation 
`$$V_i(\mathbf{pa}_i) = f_{V_i}(\mathbf{pa}_i, \epsilon_{i})$$`

---

## Example

This graph

</center>

corresponds to the nonparametric SEM (NPSEM):

`$$Z = f_Z(\epsilon_Z)$$`
`$$M(z) = f_M(z, \epsilon_M)$$`
`$$Y(z, m) = f_Y(z, m, \epsilon_Y)$$`
---

## Special Case: Linear Structural Equation Models

- A linear SEM is the special case that `$f_{V_1}, \dots, f_{V_n}$` are linear and `$\epsilon_{1}, \dots, \epsilon_n$` are mutually independent.

- In the previous example, a linear SEM would be

$$
`\begin{split}
&Z = \epsilon_Z\\
&M = \beta_{ZM} Z + \epsilon_M\\
&Y = \beta_{ZY} Z + \beta_{MY} M + \epsilon_Y
\end{split}`
$$
- In the linear case, the SEM can be written in matrix notation

$$
\mathbf{V}(\mathbf{v}) = \mathbf{B}^{\top}\mathbf{v} + \boldsymbol{\epsilon}
$$

---

## Linear SEMs

- Lots of early work on causal inference deals specifically with linear SEMs.

- Linear SEMs are easy to work with, so it is sometimes convenient to demonstrate a property using linear SEMs.

- However, this is a very restrictive model.

- For now, we will try to make as few assumptions as possible.

- When we start the modeling section, we will add some assumptions so that we can estimate parameters but it is nice to be clear about which assumptions are necessary and which are just a convenience.

---

## Completing the Causal Model Definiton

- Our definition of NPSEMs is not quite sufficient to provide a causal model because it doesn't guarantee the causal Markov property.

- For this we need an assumption about `$\epsilon_1, \dots, \epsilon_n$`.

- One sufficient assumption is that `$\epsilon_1, \dots, \epsilon_n$` are mutually independent of each other and
the other variables in the model.

- Richardson and Robins call this the NPSEM-IE (IE = Independent Errors) model. They also propose a weaker set of assumptions that is also sufficient.

- We will come back to this later.

---

# 3. Using the Properties of DAGs to Identify Conditional Exchangeability.

3.1 Confounding, Colliding, and d-separation

3.2 Backdoor criterion

3.3 Single world intervention graphs (SWIGs)

---
# 3.1 Confounding, Colliding, and d-seperation

---

## Exchangeability

- We want to know if the outcome `$O$` is exchangeable with respect to the treatment? `$O(t) \ci T$`?

- If not, what set of variables, `$L$` can we condition on such that `$O(t) \ci T \midt L$`?

<div class="grViz html-widget html-fill-item" id="htmlwidget-d586ef9923832c093602" style="width:432px;height:360px;"></div>
<script type="application/json" data-for="htmlwidget-d586ef9923832c093602">{"x":{"diagram":"digraph {\n\ngraph [layout = \"neato\",\n       outputorder = \"edgesfirst\",\n       bgcolor = \"white\"]\n\nnode [fontname = \"Helvetica\",\n      fontsize = \"10\",\n      shape = \"circle\",\n      fixedsize = \"true\",\n      width = \"0.5\",\n      style = \"filled\",\n      fillcolor = \"aliceblue\",\n      color = \"gray70\",\n      fontcolor = \"gray50\"]\n\nedge [fontname = \"Helvetica\",\n     fontsize = \"8\",\n     len = \"1.5\",\n     color = \"gray80\",\n     arrowsize = \"0.5\"]\n\n  \"1\" [label = \"Age\nA\", fontname = \"Helvetica\", fontsize = \"12\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"black\", fixedsize = \"TRUE\", width = \"1\", pos = \"0.2,1.2!\"] \n  \"2\" [label = \"Severity\nS\", fontname = \"Helvetica\", fontsize = \"12\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"black\", fixedsize = \"TRUE\", width = \"1\", pos = \"0.2,-1.2!\"] \n  \"3\" [label = \"Treatment\nT\", fontname = \"Helvetica\", fontsize = \"12\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"black\", fixedsize = \"TRUE\", width = \"1\", pos = \"2,0!\"] \n  \"4\" [label = \"Outcome\nO\", fontname = \"Helvetica\", fontsize = \"12\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"black\", fixedsize = \"TRUE\", width = \"1\", pos = \"4,0!\"] \n  \"5\" [label = \"Adherence\nAd\", fontname = \"Helvetica\", fontsize = \"12\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"black\", fixedsize = \"TRUE\", width = \"1\", pos = \"3,1.2!\"] \n\"1\"->\"3\" [color = \"black\"] \n\"2\"->\"3\" [color = \"black\"] \n\"2\"->\"4\" [color = \"black\"] \n\"3\"->\"4\" [color = \"black\"] \n\"5\"->\"4\" [color = \"black\"] \n}","config":{"engine":"dot","options":null}},"evals":[],"jsHooks":[]}</script>

</center>

---

## Recognizing Lack of Exchangeability in a DAG

- Informally, there are two sources of lack of exchangeability:

+ The presence of common causes (confounders) that have not been conditioned on.
  + Common effects (colliders) that have been conditioned on. 
  
- We will see how to formalize these statements and how to use a DAG to identify a sufficient conditioning set to remove confounding.

---

## Common Causes (Confounders)

- The presence of a common cause introduces association between two variables that is not due to a causal effect.

<div class="grViz html-widget html-fill-item" id="htmlwidget-3225c05547a0fb163549" style="width:432px;height:180px;"></div>
<script type="application/json" data-for="htmlwidget-3225c05547a0fb163549">{"x":{"diagram":"digraph {\n\ngraph [layout = \"neato\",\n       outputorder = \"edgesfirst\",\n       bgcolor = \"white\"]\n\nnode [fontname = \"Helvetica\",\n      fontsize = \"10\",\n      shape = \"circle\",\n      fixedsize = \"true\",\n      width = \"0.5\",\n      style = \"filled\",\n      fillcolor = \"aliceblue\",\n      color = \"gray70\",\n      fontcolor = \"gray50\"]\n\nedge [fontname = \"Helvetica\",\n     fontsize = \"8\",\n     len = \"1.5\",\n     color = \"gray80\",\n     arrowsize = \"0.5\"]\n\n  \"1\" [label = \"A\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0,0!\"] \n  \"2\" [label = \"Y\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"1,0!\"] \n  \"3\" [label = \"L\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0.5,0.5!\"] \n\"3\"->\"2\" [color = \"black\"] \n\"3\"->\"1\" [color = \"black\"] \n}","config":{"engine":"dot","options":null}},"evals":[],"jsHooks":[]}</script>

</center>

- In the disease treatment example, disease severity is a confounder:
  - Sicker patients are more likely to receive `$A = 0$` and sicker patients are also more likely to have a poor outcome.
  - So there would be an association between treatment and outcome, even if the two drugs worked equally well. 
  
---

## Confounding

- Confounding as a concept is quite old and therefore has been given many definitions.

- We will define confounding as the lack of exchangeability that results from common causes. 
  
- *Confounders* are variables which can be used to adjust for confounding. 
  + In the graph below, `$L_1$` and `$L_2$` are both confounders, even though only `$L_1$` is a common cause of `$A$` and `$Y$`.
  
<center>

<div class="grViz html-widget html-fill-item" id="htmlwidget-e178c39190bf715c4fca" style="width:432px;height:180px;"></div>
<script type="application/json" data-for="htmlwidget-e178c39190bf715c4fca">{"x":{"diagram":"digraph {\n\ngraph [layout = \"neato\",\n       outputorder = \"edgesfirst\",\n       bgcolor = \"white\"]\n\nnode [fontname = \"Helvetica\",\n      fontsize = \"10\",\n      shape = \"circle\",\n      fixedsize = \"true\",\n      width = \"0.5\",\n      style = \"filled\",\n      fillcolor = \"aliceblue\",\n      color = \"gray70\",\n      fontcolor = \"gray50\"]\n\nedge [fontname = \"Helvetica\",\n     fontsize = \"8\",\n     len = \"1.5\",\n     color = \"gray80\",\n     arrowsize = \"0.5\"]\n\n  \"1\" [label = \"A\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0,0!\"] \n  \"2\" [label = \"Y\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"1.5,0!\"] \n  \"3\" [label = <L<FONT POINT-SIZE=\"8\"><SUB>1<\/SUB><\/FONT>>, fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0.5,0.8!\"] \n  \"4\" [label = <L<FONT POINT-SIZE=\"8\"><SUB>2<\/SUB><\/FONT>>, fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"1,0.4!\"] \n\"3\"->\"1\" [color = \"black\"] \n\"3\"->\"4\" [color = \"black\"] \n\"4\"->\"2\" [color = \"black\"] \n}","config":{"engine":"dot","options":null}},"evals":[],"jsHooks":[]}</script>

</center>
---

## Common Effects (Colliders)

+ A variable `$L$` is a collider relative to `$A$` and `$Y$` if `$L$` is a descendant of both `$A$` and `$Y.$`

+ The presence of a collider that is not conditioned on does not induce association between `$A$` and `$Y$`.

+ But **conditioning on a collider** introduces an association between `$A$` and `$Y$`. 
  - This is not necessarily as intuitive as the bias introduced by a common cause. 
  
  
<center>

<div class="grViz html-widget html-fill-item" id="htmlwidget-44ee4e8a22ab254c8800" style="width:40%;height:180px;"></div>
<script type="application/json" data-for="htmlwidget-44ee4e8a22ab254c8800">{"x":{"diagram":"digraph {\n\ngraph [layout = \"neato\",\n       outputorder = \"edgesfirst\",\n       bgcolor = \"white\"]\n\nnode [fontname = \"Helvetica\",\n      fontsize = \"10\",\n      shape = \"circle\",\n      fixedsize = \"true\",\n      width = \"0.5\",\n      style = \"filled\",\n      fillcolor = \"aliceblue\",\n      color = \"gray70\",\n      fontcolor = \"gray50\"]\n\nedge [fontname = \"Helvetica\",\n     fontsize = \"8\",\n     len = \"1.5\",\n     color = \"gray80\",\n     arrowsize = \"0.5\"]\n\n  \"1\" [label = \"A\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0,0!\"] \n  \"2\" [label = \"Y\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"1,0!\"] \n  \"3\" [label = \"L\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0.5,-0.5!\"] \n\"2\"->\"3\" [color = \"black\"] \n\"1\"->\"3\" [color = \"black\"] \n}","config":{"engine":"dot","options":null}},"evals":[],"jsHooks":[]}</script>

</center>

---

## Collider Example: Routes to Stardom

+ Suppose that in order to become a movie star, one must either be talented or beautiful.

+ Suppose that in the population, talent and beauty are uncorrelated.

+ But both talent and beauty increase a person's chance of becoming a star.

+ Then among those who are stars, beauty and talent will be negatively correlated.

<center>
<div class="grViz html-widget html-fill-item" id="htmlwidget-77ff65b2b3c2e91dcefe" style="width:504px;height:180px;"></div>
<script type="application/json" data-for="htmlwidget-77ff65b2b3c2e91dcefe">{"x":{"diagram":"digraph {\n\ngraph [layout = \"neato\",\n       outputorder = \"edgesfirst\",\n       bgcolor = \"white\"]\n\nnode [fontname = \"Helvetica\",\n      fontsize = \"10\",\n      shape = \"circle\",\n      fixedsize = \"true\",\n      width = \"0.5\",\n      style = \"filled\",\n      fillcolor = \"aliceblue\",\n      color = \"gray70\",\n      fontcolor = \"gray50\"]\n\nedge [fontname = \"Helvetica\",\n     fontsize = \"8\",\n     len = \"1.5\",\n     color = \"gray80\",\n     arrowsize = \"0.5\"]\n\n  \"1\" [label = \"Talent\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.5\", fillcolor = \"#FFFFFF\", color = \"black\", shape = \"ellipse\", fixedsize = \"FALSE\", fontcolor = \"#000000\", pos = \"0,0!\"] \n  \"2\" [label = \"Beauty\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.5\", fillcolor = \"#FFFFFF\", color = \"black\", shape = \"ellipse\", fixedsize = \"FALSE\", fontcolor = \"#000000\", pos = \"1,0!\"] \n  \"3\" [label = \"Stardom\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.5\", fillcolor = \"#FFFFFF\", color = \"black\", shape = \"ellipse\", fixedsize = \"FALSE\", fontcolor = \"#000000\", pos = \"0.5,-0.75!\"] \n\"2\"->\"3\" [color = \"black\"] \n\"1\"->\"3\" [color = \"black\"] \n}","config":{"engine":"dot","options":null}},"evals":[],"jsHooks":[]}</script>

</center>
---

## Collider Example: Routes to Stardom

---

## Collider Example: Routes to Stardom

---

## Colliders Can "Block" Confounding

- In the graph below, there is no confounding because there is no common cause of `$A$` and `$Y$`.

- The collider `$L_2$` is "blocking" the path from `$A$` to `$Y$`.

- Causal Markov properties show us that `$A$` and `$Y$` are independent in this graph.

- d-Separation formalizes the rules for identifying pairs of independent variables based on graphical rules.

<center>
<div class="grViz html-widget html-fill-item" id="htmlwidget-3031959dbe642fc910c3" style="width:432px;height:180px;"></div>
<script type="application/json" data-for="htmlwidget-3031959dbe642fc910c3">{"x":{"diagram":"digraph {\n\ngraph [layout = \"neato\",\n       outputorder = \"edgesfirst\",\n       bgcolor = \"white\"]\n\nnode [fontname = \"Helvetica\",\n      fontsize = \"10\",\n      shape = \"circle\",\n      fixedsize = \"true\",\n      width = \"0.5\",\n      style = \"filled\",\n      fillcolor = \"aliceblue\",\n      color = \"gray70\",\n      fontcolor = \"gray50\"]\n\nedge [fontname = \"Helvetica\",\n     fontsize = \"8\",\n     len = \"1.5\",\n     color = \"gray80\",\n     arrowsize = \"0.5\"]\n\n  \"1\" [label = \"A\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0,0!\"] \n  \"2\" [label = \"Y\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"1.5,0!\"] \n  \"3\" [label = <L<FONT POINT-SIZE=\"8\"><SUB>1<\/SUB><\/FONT>>, fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0.5,0.8!\"] \n  \"4\" [label = <L<FONT POINT-SIZE=\"8\"><SUB>2<\/SUB><\/FONT>>, fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"1,0.4!\"] \n\"3\"->\"1\" [color = \"black\"] \n\"3\"->\"4\" [color = \"black\"] \n\"2\"->\"4\" [color = \"black\"] \n}","config":{"engine":"dot","options":null}},"evals":[],"jsHooks":[]}</script>
</center>
---

## No Statistical Definition of Confounding

- One commonly given characterization of a confounder is a variable which

+ Is associated with the exposure.
  + Is associated with the outcome.
  + Is not on the pathway of interest between exposure and outcome. 
  
--

- Note that `$L_2$` satisfies all of these criteria but is not a confounder.

<div class="grViz html-widget html-fill-item" id="htmlwidget-ea970d89ee6f2d10228e" style="width:432px;height:180px;"></div>
<script type="application/json" data-for="htmlwidget-ea970d89ee6f2d10228e">{"x":{"diagram":"digraph {\n\ngraph [layout = \"neato\",\n       outputorder = \"edgesfirst\",\n       bgcolor = \"white\"]\n\nnode [fontname = \"Helvetica\",\n      fontsize = \"10\",\n      shape = \"circle\",\n      fixedsize = \"true\",\n      width = \"0.5\",\n      style = \"filled\",\n      fillcolor = \"aliceblue\",\n      color = \"gray70\",\n      fontcolor = \"gray50\"]\n\nedge [fontname = \"Helvetica\",\n     fontsize = \"8\",\n     len = \"1.5\",\n     color = \"gray80\",\n     arrowsize = \"0.5\"]\n\n  \"1\" [label = \"A\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0,0!\"] \n  \"2\" [label = \"Y\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"1.5,0!\"] \n  \"3\" [label = <L<FONT POINT-SIZE=\"8\"><SUB>1<\/SUB><\/FONT>>, fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0.5,0.8!\"] \n  \"4\" [label = <L<FONT POINT-SIZE=\"8\"><SUB>2<\/SUB><\/FONT>>, fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"1,0.4!\"] \n\"3\"->\"1\" [color = \"black\"] \n\"3\"->\"4\" [color = \"black\"] \n\"2\"->\"4\" [color = \"black\"] \n}","config":{"engine":"dot","options":null}},"evals":[],"jsHooks":[]}</script>

</center>

- Determining confounding requires a causal model. 
  + The data cannot tell you if confounding is present.
  
---

## d-Separation

+ A path is *blocked* if:
  1. Two arrowheads on the path collide ( `$\rightarrow W \leftarrow$` ) at a variable that is not being conditioned on *and* which has no descendants in the conditioning set. OR
  1. It contains a non-collider that is being conditioned on. 
  
+ A path is *open* if it is not blocked:
  - It does not contain a collider and no variables on the path are being conditioned on. OR
  - All colliders are conditioned on and no non-colliders are conditioned on.
  
+ Two variables are *d-separated* if all paths between them are blocked.

---

## Examples:

Are `$A$` and `$Y$` d-separated?

<center>
.pull-left[
<div class="grViz html-widget html-fill-item" id="htmlwidget-4f6e551d1bf8a771af3a" style="width:80%;height:180px;"></div>
<script type="application/json" data-for="htmlwidget-4f6e551d1bf8a771af3a">{"x":{"diagram":"digraph {\n\ngraph [layout = \"neato\",\n       outputorder = \"edgesfirst\",\n       bgcolor = \"white\"]\n\nnode [fontname = \"Helvetica\",\n      fontsize = \"10\",\n      shape = \"circle\",\n      fixedsize = \"true\",\n      width = \"0.5\",\n      style = \"filled\",\n      fillcolor = \"aliceblue\",\n      color = \"gray70\",\n      fontcolor = \"gray50\"]\n\nedge [fontname = \"Helvetica\",\n     fontsize = \"8\",\n     len = \"1.5\",\n     color = \"gray80\",\n     arrowsize = \"0.5\"]\n\n  \"1\" [label = \"A\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0,0!\"] \n  \"2\" [label = \"Y\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"1.5,0!\"] \n  \"3\" [label = <L<FONT POINT-SIZE=\"8\"><SUB>1<\/SUB><\/FONT>>, fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0.5,0.8!\"] \n  \"4\" [label = <L<FONT POINT-SIZE=\"8\"><SUB>2<\/SUB><\/FONT>>, fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"1,0.4!\"] \n\"3\"->\"1\" [color = \"black\"] \n\"3\"->\"4\" [color = \"black\"] \n\"4\"->\"2\" [color = \"black\"] \n}","config":{"engine":"dot","options":null}},"evals":[],"jsHooks":[]}</script>
]
.pull-right[
<div class="grViz html-widget html-fill-item" id="htmlwidget-671ce305f9a8685c8901" style="width:80%;height:180px;"></div>
<script type="application/json" data-for="htmlwidget-671ce305f9a8685c8901">{"x":{"diagram":"digraph {\n\ngraph [layout = \"neato\",\n       outputorder = \"edgesfirst\",\n       bgcolor = \"white\"]\n\nnode [fontname = \"Helvetica\",\n      fontsize = \"10\",\n      shape = \"circle\",\n      fixedsize = \"true\",\n      width = \"0.5\",\n      style = \"filled\",\n      fillcolor = \"aliceblue\",\n      color = \"gray70\",\n      fontcolor = \"gray50\"]\n\nedge [fontname = \"Helvetica\",\n     fontsize = \"8\",\n     len = \"1.5\",\n     color = \"gray80\",\n     arrowsize = \"0.5\"]\n\n  \"1\" [label = \"A\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0,0!\"] \n  \"2\" [label = \"Y\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"1.5,0!\"] \n  \"3\" [label = <L<FONT POINT-SIZE=\"8\"><SUB>1<\/SUB><\/FONT>>, fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0.5,0.8!\"] \n  \"4\" [label = <L<FONT POINT-SIZE=\"8\"><SUB>2<\/SUB><\/FONT>>, fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"1,0.4!\"] \n\"3\"->\"1\" [color = \"black\"] \n\"3\"->\"4\" [color = \"black\"] \n\"2\"->\"4\" [color = \"black\"] \n}","config":{"engine":"dot","options":null}},"evals":[],"jsHooks":[]}</script>
]

<div class="grViz html-widget html-fill-item" id="htmlwidget-92cd458512c088aea9cd" style="width:40%;height:180px;"></div>
<script type="application/json" data-for="htmlwidget-92cd458512c088aea9cd">{"x":{"diagram":"digraph {\n\ngraph [layout = \"neato\",\n       outputorder = \"edgesfirst\",\n       bgcolor = \"white\"]\n\nnode [fontname = \"Helvetica\",\n      fontsize = \"10\",\n      shape = \"circle\",\n      fixedsize = \"true\",\n      width = \"0.5\",\n      style = \"filled\",\n      fillcolor = \"aliceblue\",\n      color = \"gray70\",\n      fontcolor = \"gray50\"]\n\nedge [fontname = \"Helvetica\",\n     fontsize = \"8\",\n     len = \"1.5\",\n     color = \"gray80\",\n     arrowsize = \"0.5\"]\n\n  \"1\" [label = \"A\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", shape = \"circle\", fontcolor = \"#000000\", pos = \"0,0!\"] \n  \"2\" [label = \"Y\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", shape = \"circle\", fontcolor = \"#000000\", pos = \"1.5,0!\"] \n  \"3\" [label = <L<FONT POINT-SIZE=\"8\"><SUB>1<\/SUB><\/FONT>>, fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", shape = \"circle\", fontcolor = \"#000000\", pos = \"0.5,0.8!\"] \n  \"4\" [label = <L<FONT POINT-SIZE=\"8\"><SUB>2<\/SUB><\/FONT>>, fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", shape = \"square\", fontcolor = \"#000000\", pos = \"1,0.4!\"] \n\"3\"->\"1\" [color = \"black\"] \n\"3\"->\"4\" [color = \"black\"] \n\"2\"->\"4\" [color = \"black\"] \n}","config":{"engine":"dot","options":null}},"evals":[],"jsHooks":[]}</script>

</center>

square = is conditioned on 
---

## d-Separation Implies Conditional Independence

- Let `$A$`, `$B$`, and `$C$` be sets of variables. Verma and Pearl (1988) proved that
$$ A \text{ is } d\text{-separated from }B\text{ given }C \Rightarrow A \ci B \mid C$$

- We will not prove this in class, but you may enjoy the proof in the 1988 paper if you like graph theory (see Related Reading).

- Notice that `$d$`-separation is a result about conditional independence of the variables **in** the graph.

- It does not tell us about conditional exchangeability, which is conditional independence of a counterfactual value and a variable in the graph, e.g. `$Y(A=a) \ci A \mid L$`. 
  - So far there are no counterfactuals in our graphs. 
  
---
## Faithfulness

- Faithfulness is the reverse direction.  A graph is faithful if, for any three sets of variables `$A$`, `$B$`, and `$C$`, 
`$$A \ci B \mid C \Rightarrow\ A \text{ is } d\text{-separated from }B\text{ given }C$$`

- Violations of faithfulness occur when confounding effects perfectly cancel each other.

---
## Faithfulness Example

- If the graph below corresponds to the linear SEM
$$
`\begin{split}
&A = 0.4 U_1 - 0.2 U_2 + \epsilon_A \qquad &U_1 = \epsilon_{U_1}\\
&B = 0.5 U_1 + U_2 + \epsilon_B\qquad &U_2 = \epsilon_{U_2}\\
&\epsilon_A \ci \epsilon_B \ci \epsilon_{U_1} \ci \epsilon_{U_2} \qquad &\epsilon_{*} \sim N(0, 1),
\end{split}`
$$
then `$A \ci B$`.

- But `$A$` and `$B$` are not `$d$`-separated unconditionally in this graph.

<center>
<div class="grViz html-widget html-fill-item" id="htmlwidget-22f3f0808b01ce14856d" style="width:432px;height:180px;"></div>
<script type="application/json" data-for="htmlwidget-22f3f0808b01ce14856d">{"x":{"diagram":"digraph {\n\ngraph [layout = \"neato\",\n       outputorder = \"edgesfirst\",\n       bgcolor = \"white\"]\n\nnode [fontname = \"Helvetica\",\n      fontsize = \"10\",\n      shape = \"circle\",\n      fixedsize = \"true\",\n      width = \"0.5\",\n      style = \"filled\",\n      fillcolor = \"aliceblue\",\n      color = \"gray70\",\n      fontcolor = \"gray50\"]\n\nedge [fontname = \"Helvetica\",\n     fontsize = \"8\",\n     len = \"1.5\",\n     color = \"gray80\",\n     arrowsize = \"0.5\"]\n\n  \"1\" [label = \"A\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0,0!\"] \n  \"2\" [label = \"B\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"2,0!\"] \n  \"3\" [label = <U<FONT POINT-SIZE=\"8\"><SUB>1<\/SUB><\/FONT>>, fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"1,0.5!\"] \n  \"4\" [label = <U<FONT POINT-SIZE=\"8\"><SUB>2<\/SUB><\/FONT>>, fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"1,-0.5!\"] \n\"3\"->\"1\" [color = \"black\"] \n\"3\"->\"2\" [color = \"black\"] \n\"4\"->\"1\" [color = \"black\"] \n\"4\"->\"2\" [color = \"black\"] \n}","config":{"engine":"dot","options":null}},"evals":[],"jsHooks":[]}</script>
</center>

---

## Faithfulness

- Pearl calls violations of faithfulness "incidental cancellations" because conditional independence only occurs when there are specific numerical relationships between the variables.

- There are many alternative SEMs corresponding to the same graph below in which `$A$` is not independent of `$B$` unconditionally.

- Pearl defines "stable" vs "unstable" unbiasedness. Unstable unbiasedness occurs when faithfulness is violated. 
  
<center>
<div class="grViz html-widget html-fill-item" id="htmlwidget-0410aaadc15ed7e2f917" style="width:432px;height:180px;"></div>
<script type="application/json" data-for="htmlwidget-0410aaadc15ed7e2f917">{"x":{"diagram":"digraph {\n\ngraph [layout = \"neato\",\n       outputorder = \"edgesfirst\",\n       bgcolor = \"white\"]\n\nnode [fontname = \"Helvetica\",\n      fontsize = \"10\",\n      shape = \"circle\",\n      fixedsize = \"true\",\n      width = \"0.5\",\n      style = \"filled\",\n      fillcolor = \"aliceblue\",\n      color = \"gray70\",\n      fontcolor = \"gray50\"]\n\nedge [fontname = \"Helvetica\",\n     fontsize = \"8\",\n     len = \"1.5\",\n     color = \"gray80\",\n     arrowsize = \"0.5\"]\n\n  \"1\" [label = \"A\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0,0!\"] \n  \"2\" [label = \"B\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"2,0!\"] \n  \"3\" [label = <U<FONT POINT-SIZE=\"8\"><SUB>1<\/SUB><\/FONT>>, fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"1,0.5!\"] \n  \"4\" [label = <U<FONT POINT-SIZE=\"8\"><SUB>2<\/SUB><\/FONT>>, fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"1,-0.5!\"] \n\"3\"->\"1\" [color = \"black\"] \n\"3\"->\"2\" [color = \"black\"] \n\"4\"->\"1\" [color = \"black\"] \n\"4\"->\"2\" [color = \"black\"] \n}","config":{"engine":"dot","options":null}},"evals":[],"jsHooks":[]}</script>
</center>  
  
  
  
  
---
## Faithfulness

- Practically, we can assume that faithfulness is never violated except when it is violated by design.

- Matching studies intentionally violate faithfulness. 
  + More on this in our matching lecture.

---

## Example

Which pairs of variables are d-Separated unconditionally?

<center>
<div class="grViz html-widget html-fill-item" id="htmlwidget-726f84b11166e854ceea" style="width:432px;height:360px;"></div>
<script type="application/json" data-for="htmlwidget-726f84b11166e854ceea">{"x":{"diagram":"digraph {\n\ngraph [layout = \"neato\",\n       outputorder = \"edgesfirst\",\n       bgcolor = \"white\"]\n\nnode [fontname = \"Helvetica\",\n      fontsize = \"10\",\n      shape = \"circle\",\n      fixedsize = \"true\",\n      width = \"0.5\",\n      style = \"filled\",\n      fillcolor = \"aliceblue\",\n      color = \"gray70\",\n      fontcolor = \"gray50\"]\n\nedge [fontname = \"Helvetica\",\n     fontsize = \"8\",\n     len = \"1.5\",\n     color = \"gray80\",\n     arrowsize = \"0.5\"]\n\n  \"1\" [label = \"Age\nA\", fontname = \"Helvetica\", fontsize = \"12\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"black\", fixedsize = \"TRUE\", width = \"1\", pos = \"0.2,1.2!\"] \n  \"2\" [label = \"Severity\nS\", fontname = \"Helvetica\", fontsize = \"12\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"black\", fixedsize = \"TRUE\", width = \"1\", pos = \"0.2,-1.2!\"] \n  \"3\" [label = \"Treatment\nT\", fontname = \"Helvetica\", fontsize = \"12\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"black\", fixedsize = \"TRUE\", width = \"1\", pos = \"2,0!\"] \n  \"4\" [label = \"Outcome\nO\", fontname = \"Helvetica\", fontsize = \"12\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"black\", fixedsize = \"TRUE\", width = \"1\", pos = \"4,0!\"] \n  \"5\" [label = \"Adherence\nAd\", fontname = \"Helvetica\", fontsize = \"12\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"black\", fixedsize = \"TRUE\", width = \"1\", pos = \"3,1.2!\"] \n\"1\"->\"3\" [color = \"black\"] \n\"2\"->\"3\" [color = \"black\"] \n\"2\"->\"4\" [color = \"black\"] \n\"3\"->\"4\" [color = \"black\"] \n\"5\"->\"4\" [color = \"black\"] \n}","config":{"engine":"dot","options":null}},"evals":[],"jsHooks":[]}</script>
</center>

- The causal Markov property allowed us to conclude that `$T \ci Ad \mid S, A$`.
- Because `$T$` and `$Ad$` are d-separated, we can also conclude that `$T \ci Ad$` unconditionally.

---
# 3.2 The Backdoor Criterion

---

## Backdoor Path

- A backdoor path from `$A$` to `$Y$` is a path from `$A$` to `$Y$` that begins with an edge going *into* `$A$`.

- Find the backdoor paths from `$A$` to `$Y$`.

<div class="grViz html-widget html-fill-item" id="htmlwidget-65c9bccef14538ecaf28" style="width:60%;height:180px;"></div>
<script type="application/json" data-for="htmlwidget-65c9bccef14538ecaf28">{"x":{"diagram":"digraph {\n\ngraph [layout = \"neato\",\n       outputorder = \"edgesfirst\",\n       bgcolor = \"white\"]\n\nnode [fontname = \"Helvetica\",\n      fontsize = \"10\",\n      shape = \"circle\",\n      fixedsize = \"true\",\n      width = \"0.5\",\n      style = \"filled\",\n      fillcolor = \"aliceblue\",\n      color = \"gray70\",\n      fontcolor = \"gray50\"]\n\nedge [fontname = \"Helvetica\",\n     fontsize = \"8\",\n     len = \"1.5\",\n     color = \"gray80\",\n     arrowsize = \"0.5\"]\n\n  \"1\" [label = \"A\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0,0!\"] \n  \"2\" [label = \"Y\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"1,0!\"] \n  \"3\" [label = \"L\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0.5,0.5!\"] \n  \"4\" [label = \"A\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"1.5,0!\"] \n  \"5\" [label = \"Y\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"2.5,0!\"] \n  \"6\" [label = \"L\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"2,0.5!\"] \n\"1\"->\"2\" [color = \"black\"] \n\"3\"->\"2\" [color = \"black\"] \n\"3\"->\"1\" [color = \"black\"] \n\"4\"->\"5\" [color = \"black\"] \n\"6\"->\"5\" [color = \"black\"] \n\"4\"->\"6\" [color = \"black\"] \n}","config":{"engine":"dot","options":null}},"evals":[],"jsHooks":[]}</script>

<div class="grViz html-widget html-fill-item" id="htmlwidget-95cb59eb01e49c5280bc" style="width:60%;height:180px;"></div>
<script type="application/json" data-for="htmlwidget-95cb59eb01e49c5280bc">{"x":{"diagram":"digraph {\n\ngraph [layout = \"neato\",\n       outputorder = \"edgesfirst\",\n       bgcolor = \"white\"]\n\nnode [fontname = \"Helvetica\",\n      fontsize = \"10\",\n      shape = \"circle\",\n      fixedsize = \"true\",\n      width = \"0.5\",\n      style = \"filled\",\n      fillcolor = \"aliceblue\",\n      color = \"gray70\",\n      fontcolor = \"gray50\"]\n\nedge [fontname = \"Helvetica\",\n     fontsize = \"8\",\n     len = \"1.5\",\n     color = \"gray80\",\n     arrowsize = \"0.5\"]\n\n  \"1\" [label = \"A\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0,0!\"] \n  \"2\" [label = \"Y\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"1.8,0!\"] \n  \"3\" [label = \"L\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0.9,0.4!\"] \n  \"4\" [label = <U<FONT POINT-SIZE=\"8\"><SUB>1<\/SUB><\/FONT>>, fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0.45,0.9!\"] \n  \"5\" [label = <U<FONT POINT-SIZE=\"8\"><SUB>2<\/SUB><\/FONT>>, fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"1.35,0.9!\"] \n\"4\"->\"1\" [color = \"black\"] \n\"4\"->\"3\" [color = \"black\"] \n\"5\"->\"3\" [color = \"black\"] \n\"5\"->\"2\" [color = \"black\"] \n\"1\"->\"2\" [color = \"black\"] \n\"3\"->\"2\" [color = \"black\"] \n}","config":{"engine":"dot","options":null}},"evals":[],"jsHooks":[]}</script>

</center>

---

## Backdoor Criterion and Exchangeability

Theorem: If a set of variables, `$L$`, 
  + blocks every backdoor path between `$A$` and `$Y$` 
  + contains no descendants of `$A$`,
  
then `$Y(a) \ci A \mid L$`.

--
- The two conditions in the theorem are referred to as the *backdoor criterion*.

--
- Reminder: We care about conditional exchangeability because we learned in L1 that if `$Y(a) \ci A \mid L$`, and consistency, SUTVA, and positivity hold, then 
`$$E[Y(a)] = \sum_{l}E[Y \vert A = a, L = l]P[L = l]$$`
- If we can find a measured set of variables, `$L$`, such that `$Y(a) \ci A \mid L$` and the other assumptions hold, then we know that `$E[Y(a)]$` is identifiable.

---

## Examples

Find a set of variables `$H \subseteq \left \lbrace U_1, L, U_2 \right \rbrace$` such that  `$Y(a) \ci A \mid H$`:

<div class="grViz html-widget html-fill-item" id="htmlwidget-05eb136db32eaea9f05a" style="width:60%;height:216px;"></div>
<script type="application/json" data-for="htmlwidget-05eb136db32eaea9f05a">{"x":{"diagram":"digraph {\n\ngraph [layout = \"neato\",\n       outputorder = \"edgesfirst\",\n       bgcolor = \"white\"]\n\nnode [fontname = \"Helvetica\",\n      fontsize = \"10\",\n      shape = \"circle\",\n      fixedsize = \"true\",\n      width = \"0.5\",\n      style = \"filled\",\n      fillcolor = \"aliceblue\",\n      color = \"gray70\",\n      fontcolor = \"gray50\"]\n\nedge [fontname = \"Helvetica\",\n     fontsize = \"8\",\n     len = \"1.5\",\n     color = \"gray80\",\n     arrowsize = \"0.5\"]\n\n  \"1\" [label = \"A\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0,0!\"] \n  \"2\" [label = \"Y\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"1.8,0!\"] \n  \"3\" [label = \"L\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0.9,0.4!\"] \n  \"4\" [label = <U<FONT POINT-SIZE=\"8\"><SUB>1<\/SUB><\/FONT>>, fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0.45,0.9!\"] \n  \"5\" [label = <U<FONT POINT-SIZE=\"8\"><SUB>2<\/SUB><\/FONT>>, fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"1.35,0.9!\"] \n\"4\"->\"1\" [color = \"black\"] \n\"4\"->\"3\" [color = \"black\"] \n\"5\"->\"3\" [color = \"black\"] \n\"5\"->\"2\" [color = \"black\"] \n\"1\"->\"2\" [color = \"black\"] \n}","config":{"engine":"dot","options":null}},"evals":[],"jsHooks":[]}</script>

</center>

- `$A$` and `$Y$` are exchangeable unconditionally, `$Y(a) \ci A$`, ( `$H$` is the emptyset).

- Conditioning on `$L$` induces bias. (M-Bias), `$Y(a) \nci A \mid L$`.

- If we condition on `$L$`, we need to also condition on `$U_1$` or `$U_2$` to block the now open path.

---

## Examples

Is there a set of variables, `$H$` such that  `$Y(a) \ci A \mid H$`? What if `$U_1$` and `$U_2$` are unobserved?

<div class="grViz html-widget html-fill-item" id="htmlwidget-c99e66562640b9f5d185" style="width:60%;height:216px;"></div>
<script type="application/json" data-for="htmlwidget-c99e66562640b9f5d185">{"x":{"diagram":"digraph {\n\ngraph [layout = \"neato\",\n       outputorder = \"edgesfirst\",\n       bgcolor = \"white\"]\n\nnode [fontname = \"Helvetica\",\n      fontsize = \"10\",\n      shape = \"circle\",\n      fixedsize = \"true\",\n      width = \"0.5\",\n      style = \"filled\",\n      fillcolor = \"aliceblue\",\n      color = \"gray70\",\n      fontcolor = \"gray50\"]\n\nedge [fontname = \"Helvetica\",\n     fontsize = \"8\",\n     len = \"1.5\",\n     color = \"gray80\",\n     arrowsize = \"0.5\"]\n\n  \"1\" [label = \"A\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0,0!\"] \n  \"2\" [label = \"Y\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"1.8,0!\"] \n  \"3\" [label = \"L\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0.9,0.4!\"] \n  \"4\" [label = <U<FONT POINT-SIZE=\"8\"><SUB>1<\/SUB><\/FONT>>, fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0.45,0.9!\"] \n  \"5\" [label = <U<FONT POINT-SIZE=\"8\"><SUB>2<\/SUB><\/FONT>>, fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"1.35,0.9!\"] \n\"4\"->\"1\" [color = \"black\"] \n\"4\"->\"3\" [color = \"black\"] \n\"5\"->\"3\" [color = \"black\"] \n\"5\"->\"2\" [color = \"black\"] \n\"1\"->\"2\" [color = \"black\"] \n\"3\"->\"2\" [color = \"black\"] \n}","config":{"engine":"dot","options":null}},"evals":[],"jsHooks":[]}</script>

</center>

-  `$\lbrace U_1, L\rbrace$` and `$\lbrace U_2, L \rbrace$` are both sufficient adjustment sets.

- If `$U_1$` and `$U_2$` are unobserved, there is no available set of variables we can condition on to remove bias.

---

## Examples

Is there a set of variables, `$H$` such that  `$Y(a) \ci A \mid H$`? What if `$U$` is unobserved?

<center>
<div class="grViz html-widget html-fill-item" id="htmlwidget-f5c5d0dff0182759d0f0" style="width:60%;height:216px;"></div>
<script type="application/json" data-for="htmlwidget-f5c5d0dff0182759d0f0">{"x":{"diagram":"digraph {\n\ngraph [layout = \"neato\",\n       outputorder = \"edgesfirst\",\n       bgcolor = \"white\"]\n\nnode [fontname = \"Helvetica\",\n      fontsize = \"10\",\n      shape = \"circle\",\n      fixedsize = \"true\",\n      width = \"0.5\",\n      style = \"filled\",\n      fillcolor = \"aliceblue\",\n      color = \"gray70\",\n      fontcolor = \"gray50\"]\n\nedge [fontname = \"Helvetica\",\n     fontsize = \"8\",\n     len = \"1.5\",\n     color = \"gray80\",\n     arrowsize = \"0.5\"]\n\n  \"1\" [label = \"A\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0,0!\"] \n  \"2\" [label = \"Y\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"1.8,0!\"] \n  \"3\" [label = \"L\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0.9,0!\"] \n  \"4\" [label = \"U\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0.45,0.5!\"] \n\"1\"->\"3\" [color = \"black\"] \n\"3\"->\"2\" [color = \"black\"] \n\"4\"->\"1\" [color = \"black\"] \n\"4\"->\"3\" [color = \"black\"] \n}","config":{"engine":"dot","options":null}},"evals":[],"jsHooks":[]}</script>

</center>

-  Conditioning on `$U$` blocks all backdoor paths.

- Could we condition on `$L$` instead?

- `$L$` is a descendant of `$A$`, so it does not satisfy the backdoor criterion. It is not true that `$Y(a) \ci A \vert L$`.

---
# 3.3. Single World Intervention Graphs

---

## Single World Intervention Graphs (SWIGs)

- SWIGs are a method of including counterfactuals in DAGs proposed by Richardson and Robins.

- This is an alternative to a strategy called graph surgery proposed by Judea Pearl.

- SWIGs make the intervention more explicit than graph surgery does.

- Using SWIGs, we can evaluate conditional exchangeability using only `$d$`-separation without having to 
manage the other aspects of the backdoor criterion.

---

## SWIGs

- To create a SWIG, the node representing the intervened on variable is split into two nodes. 
  - One node represents the natural (observable) value of of the variable.
  - The other represents the fixed value due to the intervention.

</center>

---

## Templates (SWITs)

- Each SWIG can represent only a single intervention, i.e. the world in which everyone receives treatment `$A = a$`.

- A template is a graph valued function `$x \rightarrow \mathcal{G}(x)$`.  The input is the value the intervened-on variable is set to, the output is a SWIG.
  
<center>

</center>
  
---

## Single World

+ `$Y(0)$` and `$Y(1)$` never appear on the same graph.

+ SWIGs cannot represent relationships between counterfactuals "across worlds" (i.e. `$Y(0)$` and `$Y(1)$` ).

+ From the SWIGs below, we can conclude `$Y(0) \ci X$` and `$Y(1) \ci X$` but not `$Y(1), Y(0) \ci X$` or `$Y(1) \ci Y(0)$`.

</center>

---

## SWIG Procedure

- Step 1: Split nodes

</center>

- Split every intervention node into
  - `$A$` the random component; what `$A$` would have been without intervention.
  - `$a$` a fixed component representing the intervention.

- Incoming arrows go into `$A$` and outgoing arrows go out of `$a$`.

---

## SWIG Procedure

- Step 2: Re-label downstream nodes as counterfactuals.

</center>
---

## d-Separation in SWIGs

- In a SWIG, any path containing an intervention node is blocked.

- This sounds like a new rule but really usn't.

- Intervention nodes are fixed at a value, so no information can propagate through them.

- Fixed nodes in any graph block a path.

- However, it is atypical to include fixed nodes in non-SWIG graphs, so this is an important rule to remember for SWIGs.

---

## d-Separation in SWIGs

- Under the NPSEM-IE assumptions, `$d$`-separation in the SWIG implies exchangeability.

- That is, if `$\mathcal{G}(a)$` is the SWIG of the intervention `$A = a$` and `$Y(a)$` and `$A$` are `$d$`-separated in `$\mathcal{G}(a)$`, then 
`$$Y(a) \ci A$$`

- We have a similar result for conditional exchangeability.

- If `$Y(a)$` and `$A$` are `$d$`-separated in `$\mathcal{G}(a)$` conditional on a set of nodes `$L$`, then 
`$$Y(a) \ci A \mid \ L$$`
- NPSEM-IE is stronger than necessary to achieve this result.

---
## Applying d-separation in SWIGs

</center>

- This is the M-Bias example.

- We can see that `$Y(a)$` is d-separated from `$A$` unconditionally.

- Conditional on `$Z$`, `$Y(a)$` is not d-separated from `$A$`.

---

## Independence Assumptions

- As we said before, the NPSEM model is not sufficient to conclude the causal Markov property. We need an additional assumption.

- FFRCISTG Assumption: Let `$\mathbf{v}^\dagger$` be an intervention on every variable in `$V$`. The FFRCISTG independence assumption says that all of the counterfactual variables `$\lbrace V(\mathbf{v}^\dagger) \rbrace$` are mutually independent after this intervention.

- NPSEM-IE assumption: The NPSEM-IE assumption says that all of the errors `$\epsilon_V$` are independent.

- NPSEM-IE is strictly stronger than FFRCISTG.

- NPSEM-IE implies cross-world independences while FFRCISTG does not.

- We will generally always assume FFRCISTG.

---

## FFRCISTG vs NPSEM-IE
<center>

</center>

- FFRCISTG says $$ Z\ci M(z) \ci Y(z, m)$$ for any `$z$` and `$m$`.

- NPSEM-IE says `$$Z \ci \lbrace M(z) \text{ for all } z \rbrace \ci \lbrace Y(z, m)\ \text{for all } z, m \rbrace$$`

- For example, `$M(Z = 0) \ci Y(Z = 1, M = 0)$`.

---

## Factorization Result

- An important result is that the FFRCISTG assumption is sufficient to conclude that if  `$P(\mathbf{V})$` factorizes according to `$\mathcal{G}$` then  `$P(\mathbf{V}(\mathbf{a}))$` factorizes according to the SWIG `$\mathcal{G}(\mathbf{a})$`.

- This result is saying that the SWIG is an accurate representation of the world of intervention `$\mathbf{a}$`.

- The proof of this result works by reverse induction.

- We won't prove this in class but there is a nice illustration in Appendix B1 of Richardson and Robins (2013).
  
---

## SWIGs, Exchangeability, and d-Separation

- The factorization result allows us to conclude that d-separation in the SWIG implies conditional exchangeability. 
  - Recall: Conditional exchangeability says that `$Y(a) \ci A \mid L$`.
  - Since `$Y(a)$` is a node in the SWIG, we can now just read conditional exchangeability off of the graph using d-separation.

- Achieving `$d$`-separation in the SWIG is equivalent to satisfying the backdoor criterion.

- So factorization result also implies the FFRCISTG assumptions are sufficient for the backdoor backdoor criterion theorem to hold.

- Since NPSEM-IE is strictly stronger than FFRCISTG, NPSEM-IE assumptions also imply that the backdoor criterion theorem holds.

---

## Descendents of `$A$`

+ Using SWIGs helps show why the backdoor criterion excludes descendants of `$A$`.

</center>

+ From the SWIG on the right we can conclude that `$Y(x) \ci X \mid L_1, L_2(x)$`.

+ However, we cannot conclude that `$Y(x) \ci X \mid L_1, L_2$` because `$L_2$` is not on the graph.

+ In fact, the second statement is false. Conditioning on `$L_2$` introduces a type of selection bias (more on this in L4).

---

## More Examples: Confounder

</center>

- `$Y(m)$` is not unconditionally independent of `$M$` but, `$Y(m) \ci M \mid Z$`

- The factorization result gives us
`$$P(Z, M, Y(m)) = P(Z)P(M \vert Z)P(Y(m) \vert Z)$$`

- Note that we left the fixed node out of the probability calculation.

---

## Mediator

</center>

- Here we do have unconditional exchangeability, `$Y(z) \ci Z$`.

- From factorization, `$P(Z, M(z), Y(z)) = P(Z)P(M(z))P(Y(z) \vert M(z))$`

---

## Mediator with Two Interventions

</center>

- From this graph we can get `$Y(z, m) \ci M(z)$`.

- We have to use the new rule that fixed nodes block paths.

- This is saying that intervening on `$M$` blocks the effect of `$Z$` that is propogated through `$M(z)$`.

- From factorization, `$P(Z, M(z), Y(z)) = P(Z)P(M(z))P(Y(z))$`

---

## Mediation Effects

<div class="grViz html-widget html-fill-item" id="htmlwidget-31e62a0195631de10256" style="width:40%;height:144px;"></div>
<script type="application/json" data-for="htmlwidget-31e62a0195631de10256">{"x":{"diagram":"digraph {\n\ngraph [layout = \"neato\",\n       outputorder = \"edgesfirst\",\n       bgcolor = \"white\"]\n\nnode [fontname = \"Helvetica\",\n      fontsize = \"10\",\n      shape = \"circle\",\n      fixedsize = \"true\",\n      width = \"0.5\",\n      style = \"filled\",\n      fillcolor = \"aliceblue\",\n      color = \"gray70\",\n      fontcolor = \"gray50\"]\n\nedge [fontname = \"Helvetica\",\n     fontsize = \"8\",\n     len = \"1.5\",\n     color = \"gray80\",\n     arrowsize = \"0.5\"]\n\n  \"1\" [label = \"A\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"-0.1,0.5!\"] \n  \"2\" [label = \"Y\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"1,0!\"] \n  \"3\" [label = \"L\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0.2,0!\"] \n\"1\"->\"2\" [color = \"black\"] \n\"1\"->\"3\" [color = \"black\"] \n\"3\"->\"2\" [color = \"black\"] \n}","config":{"engine":"dot","options":null}},"evals":[],"jsHooks":[]}</script>
</center>

- In the graph above, `$L$` is *mediating* part of the effect of `$A$` on `$Y$`.

- Using our machinery so far, we can define the total effect (TE) of `$A$` on `$Y$` as
`$$E[Y(A = 1)] - E[Y(A = 0)]$$`
- We might be interested in the effect of `$A$` that is not mediated through `$L$`.

- This would be the effect of `$A$` on `$Y$` if we intervened on `$L$` and prevented `$L$` from changing. 
  - For example, we intervene on `$A$` and set it to 1. 
  - But we intervene on `$L$` and set it to `$L(0)$`. 
---

## Natural Direct and Indirect Effect

<div class="grViz html-widget html-fill-item" id="htmlwidget-295bacb1831409d37deb" style="width:40%;height:144px;"></div>
<script type="application/json" data-for="htmlwidget-295bacb1831409d37deb">{"x":{"diagram":"digraph {\n\ngraph [layout = \"neato\",\n       outputorder = \"edgesfirst\",\n       bgcolor = \"white\"]\n\nnode [fontname = \"Helvetica\",\n      fontsize = \"10\",\n      shape = \"circle\",\n      fixedsize = \"true\",\n      width = \"0.5\",\n      style = \"filled\",\n      fillcolor = \"aliceblue\",\n      color = \"gray70\",\n      fontcolor = \"gray50\"]\n\nedge [fontname = \"Helvetica\",\n     fontsize = \"8\",\n     len = \"1.5\",\n     color = \"gray80\",\n     arrowsize = \"0.5\"]\n\n  \"1\" [label = \"A\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"-0.1,0.5!\"] \n  \"2\" [label = \"Y\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"1,0!\"] \n  \"3\" [label = \"L\", fontname = \"Helvetica\", fontsize = \"10\", width = \"0.3\", fillcolor = \"#FFFFFF\", color = \"black\", fontcolor = \"#000000\", pos = \"0.2,0!\"] \n\"1\"->\"2\" [color = \"black\"] \n\"1\"->\"3\" [color = \"black\"] \n\"3\"->\"2\" [color = \"black\"] \n}","config":{"engine":"dot","options":null}},"evals":[],"jsHooks":[]}</script>
</center>
  
- The *natural direct effect*(NDE) is
`$$E[Y(A = 1, L = L(0))] - E[Y(A = 0, L = L(0)]$$`
- The *natural indirect effect*(NIE) is

$$
E[Y(A = 1, L= L(1))] - E[Y(A = 1, L = L(0))]
$$
- So TE = NDE + NIE

- Note that both of these involve "cross-world" counterfactuals.

---

## Identifying NIE and NDE

- The FFRCISTG independence assumption does not allow identification of the NIE and NDE.

- See Robins and Greenland (1992) for a good example.

- If we further assume the NPSEM-IE model, NIE and NDE are identifiable.

- We will come back to this in the future to talk more about mediation.