Random effects misspecification can have severe consequences for random effects inference in linear mixed models

Francis K. C. Hui*, Samuel Müller, Alan H. Welsh

*Corresponding author for this work

Research output: Contribution to journalArticle

Abstract

There has been considerable and controversial research over the past two decades into how successfully random effects misspecification in mixed models (i.e. assuming normality for the random effects when the true distribution is non‐normal) can be diagnosed and what its impacts are on estimation and inference. However, much of this research has focused on fixed effects inference in generalised linear mixed models. In this article, motivated by the increasing number of applications of mixed models where interest is on the variance components, we study the effects of random effects misspecification on random effects inference in linear mixed models, for which there is considerably less literature. Our findings are surprising and contrary to general belief: for point estimation, maximum likelihood estimation of the variance components under misspecification is consistent, although in finite samples, both the bias and mean squared error can be substantial. For inference, we show through theory and simulation that under misspecification, standard likelihood ratio tests of truly non‐zero variance components can suffer from severely inflated type I errors, and confidence intervals for the variance components can exhibit considerable under coverage. Furthermore, neither of these problems vanish asymptotically with increasing the number of clusters or cluster size. These results have major implications for random effects inference, especially if the true random effects distribution is heavier tailed than the normal. Fortunately, simple graphical and goodness‐of‐fit measures of the random effects predictions appear to have reasonable power at detecting misspecification. We apply linear mixed models to a survey of more than 4 000 high school students within 100 schools and analyse how mathematics achievement scores vary with student attributes and across different schools. The application demonstrates the sensitivity of mixed model inference to the true but unknown random effects distribution.

Original languageEnglish
JournalInternational Statistical Review
Early online date15 Apr 2020
DOIs
Publication statusE-pub ahead of print - 15 Apr 2020
Externally publishedYes

Keywords

  • Fixed effects
  • hypothesis testing
  • maximum likelihood
  • prediction
  • robustness
  • variance components

Fingerprint Dive into the research topics of 'Random effects misspecification can have severe consequences for random effects inference in linear mixed models'. Together they form a unique fingerprint.

  • Cite this