VIM Causal Inference

Comments

Macartan Humphreys

Joohye Jeong

Stimuli Augmentation

Kudos

Sharp focus on balance/control in complex vignettes
Lots of thoughts about processes, complexities, risks
Nice illustrations and demonstration of effectiveness
Some very clever diagnoistics

Very hard to argue against this: sensible and scaleable

Causal ordering and entanglements

The thing that I have been wondering most about:

Does evidence of successful targeting have to be theory based not empirical? Or:
What implication for current diagnostics?
What implication for alternative strategies?

Writing

LLMs do not seem central here?
- Rather gain of algorithm is that these could be done with LLMs or not
- Describe core strategy in the abstract and demonstrate with LLMs
Also useful perhaps if there are different audiences for the formal features and the practical guidelines

So?

Theory paper
Guidance paper: general strategy plus LLM implementation (package?)

Weiss and Dursun [Ekin]

An Instrumental Variables Approach for More Robust Emotion Experiments

Kudos

Very thoughtful piece
Nice conceptualizations
Multiple implementations
Constructive approach for a hard problem
Multiple encouragements to capture both a type of specificity and sensitivity
- \(\Pr(\text{not activated} | \text{not targeted by } Z)\)
- \(\Pr(\text{activated} | \text{targeted by } Z)\)

Sensitivity and specificity: Current

Sensitivity and specificity: Possible?

For pairs anyhow…

Vignettes beat Images? Where would the indifference curves be here?

Which of these is going on?

Exclusion restriction concern

Just how it is

Or truly entangled?

Should we just focus on (real world) Zs? Is this an instruments problem or mediation problem?

Metrics

In what sense are anger and fear on the same metric?

Here we focus on changes normalized by variance But what is to say that is right?

Myla Burton: Null worlds

Myla Burton: Null Worlds

Kudos

the framework constructs theory-aligned alternatives that preserve the persistence and interdependence of political data while excluding the mechanism of interest

Direct shot at model dependency

Assess what you have to believe about the world to draw the inferences you do; in which worlds would you get lost

love theory focus
really nicely written
really nice interpretation and modest handling of different findings

Scope for synthesis?

Each null world is taken one at a time.
Can we combine?
What’s the probability of seeing data like this given no true effect and the following distribution over worlds?
Or can we nest? When should we nest? When can we not?
at least within “tractable set”

Ex ante or ex post

Also connects to one idea in DeclareDesign

Principle 3.2: a good design should work … even when the world is different from what we expect

Null models: An important example is the performance of a research design under a “null model,” where the true effect size is zero.

The DeclareDesign approach would do this ex ante, independent of realized data; you do ex post. What are the merits of each?

Fishing risks

Implications for replications
Scope for critics to fish null worlds

Writing

There is a general deep idea here and there is a specific application
The application is a little complicated (and you need to bring readers up to speed on the analysis strategy)
Can you either separate or else build out with abstract examples before this application

Lin, Liu: To Impute or Not? [Shiyao]

Kudos

Great question
Love the focus on theory and thinking structurally about this
Nice examples of when things are easy and when they are hard.

Insight

Key insight:

determining whether \(\tau_{CCA}\) is biased for \(\tau\) reduces to assessing whether including \(M_i\) as a control in the oracle regression biases the treatment estimation for \(\tau\) , i.e. whether \(M_i\) “over-controls”.

Insight

Let \(M\) denote mssingness. Key insight:

Controlling for \(M\) is overcontrolling. It does not open a backdoor path, but it does violate the backdoor criterion, being post-treatment.

What’s missing? \(X\) or \(Y\)? If \(X\) it is MAR. If \(Y\) it is MNAR. But the data look the same.

Lin, Liu: Illustration

df <- fabricate(N = 10000,
                 D = complete_ra(N),
                 Y = simple_ra(N, prob_unit =.25 + .5 * D),
                 M = simple_ra(N, prob_unit = .8*Y))

Oracle:

term	estimate	std.error	statistic	df
(Intercept)	0.06	0.00	16.57	9996
D	0.29	0.01	25.76	9996
M	0.94	0.00	239.90	9996
D:M	-0.29	0.01	-25.76	9996

Conditioned:

term	estimate	std.error	statistic	p.value	df
(Intercept)	0.06	0.00	16.57	0	5983
D	0.29	0.01	25.76	0	5983

Lin, Liu: Omnibus missingness node

Say you do not control for \(U\)

Then no problem here?

But here it looks like a problem?

Moderators and missingness

Say:

\(W\) moderates the effect of \(X\) on \(Y\). Say \(M=1\) when \(W = 1\), then we only observe the effect of \(X\) on \(Y\) when \(W = 0\).
But controlling for M is not “overcontrolling”?
The coefficient changes but it would not with a demeaned treatment. This is a support question: estimand changes.

Moderation

df <- fabricate(N = 10000, D = complete_ra(N),
                W = complete_ra(N), Y = D*W, M = W==1)

Ideal:

term	estimate	std.error	p.value	df
(Intercept)	0.0	0.00	0	9998
D	0.5	0.01	0	9998

CATE (\(M = W =0\)), correctly recovered from missing data analysis:

term	estimate	df
(Intercept)	0	9996
D	0	9996
MTRUE	0	9996
D:MTRUE	1	9996

Build out connections

Bareinboim and Pearl (2016) Bareinboim and Pearl (2012), Bareinboim, Tian, and Pearl (2014), Mohan and Pearl (2021), Saadati and Tian (2019)

Bareinboim, Elias, and Judea Pearl. 2012. “Controlling Selection Bias in Causal Inference.” In Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics (AISTATS), 100–108. https://proceedings.mlr.press/v22/bareinboim12/bareinboim12.pdf.

———. 2016. “Causal Inference and the Data-Fusion Problem.” Proceedings of the National Academy of Sciences 113 (27): 7345–52.

Bareinboim, Elias, Jin Tian, and Judea Pearl. 2014. “Recovering from Selection Bias in Causal and Statistical Inference.” In Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence. https://ftp.cs.ucla.edu/pub/stat_ser/r425.pdf.

Mohan, Karthika, and Judea Pearl. 2021. “Graphical Models for Processing Missing Data.” Journal of the American Statistical Association 116 (534): 1023–37. https://www.tandfonline.com/doi/full/10.1080/01621459.2021.1874961.

Saadati, Mojdeh, and Jin Tian. 2019. “Adjustment Criteria for Recovering Causal Effects from Missing Data.” arXiv Preprint arXiv:1907.01654. https://arxiv.org/abs/1907.01654.