• Graduate program
    • Why Tinbergen Institute?
    • Research Master
    • Admissions
    • Course Registration
    • Facilities
    • PhD Vacancies
    • Selected PhD Placements
    • Research Master Business Data Science
  • Research
  • Browse our Courses
  • Summer School
  • Events
    • Summer School
      • Applied Public Policy Evaluation
      • Deep Learning
      • Economics of Blockchain and Digital Currencies
      • Economics of Climate Change
      • Foundations of Machine Learning with Applications in Python
      • From Preference to Choice: The Economic Theory of Decision-Making
      • Gender in Society
      • Machine Learning for Business
      • Marketing Research with Purpose
      • Sustainable Finance
      • Tuition Fees and Payment
      • Business Data Science Summer School Program
    • Events Calendar
    • Events Archive
    • Tinbergen Institute Lectures
    • 16th Tinbergen Institute Annual Conference
    • Annual Tinbergen Institute Conference
  • News
  • Alumni
Home | Events Archive | Genuinely Robust Inference for Clustered Data
Seminar

Genuinely Robust Inference for Clustered Data


  • Location
    University of Amsterdam, room E5.07
    Amsterdam
  • Date and time

    March 14, 2025
    13:00 - 14:00

Abstract

Conventional methods for cluster-robust inference are inconsistent when clusters of unignorably large size are present. We formalize this issue by deriving a necessary and sufficient condition for consistency, a condition frequently violated in empirical studies. Specifically, 77% of empirical research articles published in American Economic Review and Econometrica during 2020–2021 do not satisfy this condition. To address this limitation, we propose two alternative approaches: (i) score subsampling and (ii) size-adjusted reweighting. Both methods ensure uniform size control across broad classes of data-generating processes where conventional methods fail. The first approach (i) has the advantage of ensuring robustness while retaining the original estimator. The second approach (ii) modifies the estimator but is readily implementable by practitioners using statistical software such as Stata and remains uniformly valid even when the cluster size distribution follows Zipf’s law. Extensive simulation studies support our findings, demonstrating the reliability and effectiveness of the proposed approaches.

Link to Paper: https://arxiv.org/abs/2308.10138