Random effects misspecification can have severe consequences for random effects inference in linear mixed models

Francis K. C. Hui*, Samuel Müller, Alan H. Welsh

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

8 Citations (Scopus)

Abstract

There has been considerable and controversial research over the past two decades into how successfully random effects misspecification in mixed models (i.e. assuming normality for the random effects when the true distribution is non‐normal) can be diagnosed and what its impacts are on estimation and inference. However, much of this research has focused on fixed effects inference in generalised linear mixed models. In this article, motivated by the increasing number of applications of mixed models where interest is on the variance components, we study the effects of random effects misspecification on random effects inference in linear mixed models, for which there is considerably less literature. Our findings are surprising and contrary to general belief: for point estimation, maximum likelihood estimation of the variance components under misspecification is consistent, although in finite samples, both the bias and mean squared error can be substantial. For inference, we show through theory and simulation that under misspecification, standard likelihood ratio tests of truly non‐zero variance components can suffer from severely inflated type I errors, and confidence intervals for the variance components can exhibit considerable under coverage. Furthermore, neither of these problems vanish asymptotically with increasing the number of clusters or cluster size. These results have major implications for random effects inference, especially if the true random effects distribution is heavier tailed than the normal. Fortunately, simple graphical and goodness‐of‐fit measures of the random effects predictions appear to have reasonable power at detecting misspecification. We apply linear mixed models to a survey of more than 4 000 high school students within 100 schools and analyse how mathematics achievement scores vary with student attributes and across different schools. The application demonstrates the sensitivity of mixed model inference to the true but unknown random effects distribution.

Original languageEnglish
Pages (from-to)186-206
Number of pages21
JournalInternational Statistical Review
Volume89
Issue number1
Early online date15 Apr 2020
DOIs
Publication statusPublished - Apr 2021
Externally publishedYes

Keywords

  • Fixed effects
  • hypothesis testing
  • maximum likelihood
  • prediction
  • robustness
  • variance components

Fingerprint

Dive into the research topics of 'Random effects misspecification can have severe consequences for random effects inference in linear mixed models'. Together they form a unique fingerprint.

Cite this