• Graduate program
    • Why Tinbergen Institute?
    • Research Master
    • Admissions
    • Course Registration
    • Facilities
    • PhD Vacancies
    • Selected PhD Placements
    • Research Master Business Data Science
  • Research
  • Browse our Courses
  • Events
    • Summer School
      • Applied Public Policy Evaluation
      • Deep Learning
      • Economics of Blockchain and Digital Currencies
      • Economics of Climate Change
      • Foundations of Machine Learning with Applications in Python
      • From Preference to Choice: The Economic Theory of Decision-Making
      • Gender in Society
      • Machine Learning for Business
      • Marketing Research with Purpose
      • Sustainable Finance
      • Tuition Fees and Payment
      • Business Data Science Summer School Program
    • Events Calendar
    • Events Archive
    • Tinbergen Institute Lectures
    • 16th Tinbergen Institute Annual Conference
    • Annual Tinbergen Institute Conference
  • News
  • Alumni
Home | Events | Model Selection in Large Dimensional Linear Regression using Sequential Multiple Testing
Seminar

Model Selection in Large Dimensional Linear Regression using Sequential Multiple Testing


  • Location
    University of Amsterdam, Roeterseilandcampus, E5.07
    Amsterdam
  • Date and time

    September 19, 2025
    12:30 - 13:30

High dimensional regression specification and analysis is a complex and active area of research in statistics and econometrics. A large number of approaches has been proposed each with its own limitations and challenges. This paper proposes a new hybrid approach combining elements from two existing methods. The first is the greedy methodology developed by Chudik et al. (2018), where a powerful multiple testing step introduces parsimony, by ensuring that completely irrelevant variables are not selected, with high probability. The second is stage-wise regression where relevant variables are selected in steps, rather than jointly as in Chudik et al. (2018). However, in that literature the stopping rules, which typically rely on model information criteria, are not sufficiently parsimonious. We derive some theoretical properties of the new method and show, through simulations, that it performs well. An illustration, using corporate emissions data, provides an empirical perspective.