◆ THE BLUEPRINT
Simpson's Paradox
A trend that appears in aggregated data can reverse when the data is split into groups. This is not rare. It happens whenever a confounding variable creates different subpopulations.
Why It Matters
If your data has group structure and you ignore it, your model can give you the wrong sign on a coefficient. The overall slope and the within-group slopes can point in opposite directions.
The Fix
Account for the group structure in your model. The next tabs explore three ways to do this: ignore groups entirely (complete pooling), treat each group independently (no pooling), or model groups as drawn from a distribution (partial pooling).