Genuinely Robust Inference for Clustered Data

Series

Econometrics Seminars and Workshop Series
Speaker(s)

Yulong Wang (Syracuse University, United States)
Field

Econometrics, Data Science and Econometrics

Location

University of Amsterdam, room E5.07
Amsterdam
Date and time

March 14, 2025
13:00 - 14:00

Abstract

Conventional methods for cluster-robust inference are inconsistent when clusters of unignorably large size are present. We formalize this issue by deriving a necessary and sufficient condition for consistency, a condition frequently violated in empirical studies. Specifically, 77% of empirical research articles published in American Economic Review and Econometrica during 2020–2021 do not satisfy this condition. To address this limitation, we propose two alternative approaches: (i) score subsampling and (ii) size-adjusted reweighting. Both methods ensure uniform size control across broad classes of data-generating processes where conventional methods fail. The first approach (i) has the advantage of ensuring robustness while retaining the original estimator. The second approach (ii) modifies the estimator but is readily implementable by practitioners using statistical software such as Stata and remains uniformly valid even when the cluster size distribution follows Zipf’s law. Extensive simulation studies support our findings, demonstrating the reliability and effectiveness of the proposed approaches.

Link to Paper: https://arxiv.org/abs/2308.10138