So much good news!
My comments mostly marginal
Identification uses fact that:
“villages whose population exceeds one thousand should be allocated into their own GP” 1995/ 2015
Two strategies:
For leavers:
\[\mathbb E_x\mathbb E_{i\in A_x}[y_{i}(x) - y_i(1000)]\] where \(A_x\) is the set of villages with population around 1000 in a GP of size \(x\).
For remainers
\[\mathbb E_x\mathbb E_{i\in B_x}[y_i(x) - y_i(x-1000)]\]
where \(B_x\) is the set of villages with neighbor village around 1000 in a GP of size \(x\).
Note:
Average effect of dropping by 1000 seems more interpretable than effect of dropping to 1000. More like a derivative!
Even with this cleverness:
How comparable are the average numbers between strategy 1 and 2? Quite different estimands.
How interpretible are differences between strata? these are not as-if random
Substantive 1: How much of this could be novelty effects?
Substantive 2: Are there fixed costs per unit? Are smaller units more expensive with very diffuse costs? Interpretation if this is a transfer
Clarifying this might help with interpreting interactions:
Interaction inferences: Consider the difference between
For the splitter B is a bigger shock than A
For the remainer A is a bigger shock than B
Inquiry:
And
Did you consider pre-registering
Rumps could be so small?
Comment strategy 2: Is taking size of largest the most efficient? Multiple leavers? Predicable population growth: can you use the prediction around 1000.
Tables and Figures in text please!
BUt: