from Epidemiological
Bulletin , Vol. 25 No. 1, march 2004
A Glossary for Multilevel Analysis - Part III
Ana V. Diez Roux
Divisions of Medicine and Epidemiology, Columbia University
New York, New York, United States
Non-independence of observations
Refers to situations in which dependent variables for observations at a lower
level nested within the same higher level unit (or cluster) are correlated,
even after measured characteristics are taken into account. For example,
two persons from the same neighborhood may tend to have more similar blood
pressure levels than two persons from different neighborhoods, even after
measured individual and neighborhood characteristics are taken into account.
In the case of repeat measures on individuals over time, two blood pressure
measurements on the same person may tend to be more similar than two measures
on different persons even after relevant covariates are taken into account.
One reason for this correlation may have to do with the omission of important
higher level variables that observations within the same higher level unit
share. This residual correlation violates the assumption of independence
of observations underlying usual regression approaches. Ignoring this correlation
may lead to incorrect inferences. Efficiency of estimation may also be reduced.40
Multilevel models account for potential residual correlation by modelling
intercepts and regression coefficients as random (for example, by allowing
for macro level errors, U0j and U1j in second level equations, see Multilevel
models).
Population-average models
Models that account for correlation between lower level units within higher
level units (or clusters) by modelling the correlations or covariances themselves
rather than by allowing for random effects or random coefficients as Multilevel
models do.40, 46 These correlations are taken into account in the estimation
of regression coefficients and their standard errors. Different correlation
structures (describing within cluster or within higher level unit correlations)
can be specified. “Population-average models” are also referred
to as “marginal models”40, 46 or “covariance pattern models”.26
Whereas multilevel models model the dependent variable conditional on the
random effects (or random coefficients), population-average models model
the marginal expectation of the dependent variables across the population
(in a sense, “averaged “ across the random effects). For this
reason, marginal models have also been called “population-average” models
(as a way to contrast them with subject specific random effects models).46
The Generalized Estimating Equation (GEE) approach is one approach to fitting
marginal models.46
Population-average models model the population-average response as a function
of covariates without explicitly accounting for heterogeneity across higher
level units.46 In contrast, multilevel models investigate and explain the source
of group to group variation (and of the within group correlation) by modelling
group specific regression coefficients as a function of group level variables
plus random variation. Therefore, although population-average models account
for the correlation between outcomes within higher level units, the source
of this correlation is not directly investigated (the correlation, and sometimes
higher level effects themselves, are viewed as nuisance parameters that must
be taken into account but are not of direct interest). Therefore, population
average models do not allow examination of group to group variation, of the
group level or individual level variables related to it, or of the degree of
variation present between and within groups, as multilevel models do (see variance
components). Differences between both types of models also have consequences
for the interpretation of regression coefficients: in the multilevel model,
the regression coefficient estimates how the response changes as a function
of covariates conditional on the random effects; in the marginal model, the
coefficient expresses how the response changes as a function of covariates “averaged” over
group to group heterogeneity (or group random effects).40, 46 In the case of
continuous dependent variables these coefficients are mathematically equivalent,
but in the case of non-normally distributed variables (for example, logistic
models) the marginal parameter values will usually be smaller in absolute value
than their random effects analogues.46, 47
Psychologistic fallacy
An inferential fallacy that may arise from the failure to consider group characteristics
in drawing inferences regarding the causes of variability across individuals1,
2—that is, assuming that individual level outcomes can be explained
exclusively in terms of individual level characteristics. Although the level
at which data are collected may fit the conceptual model being investigated
(that is, individual level), important facts pertaining to other levels (that
is, group level) may have been ignored.1, 2 For example, a study based on
individuals might find that immigrants are more likely to develop depression
than natives. But suppose this is only true for immigrants living in communities
where they are a small minority. A researcher ignoring the contextual effect
of community composition might attribute the higher overall rate in immigrants
to the psychological effects of immigration or to genetic factors, ignoring
the importance of community level factors and thus committing the psychologistic
fallacy.1 The term “psychologistic fallacy” is not entirely appropriate
because the individual level factors used to explain the outcome are not
always exclusively psychological.2 Although the term “individualistic
fallacy”may appear more adequate, it has also been used as a synonym
for the related but distinct atomistic fallacy.3, 4 See also sociologistic
fallacy.
Random coefficient models
Term originally used for models in which the regression coefficients corresponding
to covariates in the model are treated as random rather than fixed19, 26
(that is, models containing random coefficients, see for example b1j in the
entry for multilevel models). Traditional random coefficient models do not
include higher level (or group level) predictors in the group level equations
for the covariate effects (that is, in a traditional random coefficient model,
equation (3) would be b1j = 10 + U1j).19 Thus random coefficient models can
be thought of as a particular case of the more general multilevel models.
However, the term random coefficient models is sometimes used more broadly
to refer to multilevel models generally. See also random effects models.
Random effects/random coefficients
Regression coefficients (intercepts or covariate effects) that are allowed
to vary randomly across higher level units (that is, are assumed to be realizations
of values from a probability distribution) (see multilevel models). For example,
in the case of persons nested within neighborhoods, neighborhood effects
can be assumed to vary randomly around an overall mean (random effect, see
random effects models). Similarly, the effect of personal income on individual
health may be allowed to vary randomly across neighborhoods (random coefficient,
see random coefficient models). Although the terms “random effects” and “random
coefficients” are sometimes distinguished as noted above, they are
often used interchangeably. The use of random effects or random coefficients
is especially appropriate when the higher level units (or groups) can be
thought of as random samples from a larger population of units (or groups)
about which inferences wish to be made. See also fixed effects/fixed coefficients.
Random effects models
Term originally used for models in which differences across groups (or other
classification system) are treated as random rather than fixed19, 26 (that
is, models containing random effects). For example, in the case involving
individuals nested within neighborhoods, a model treating neighborhood differences
as fixed would include all neighborhoods represented in the sample as a set
of dummy variables in a regression equation with individuals as the units
of analysis (see fixed coefficients). In contrast, a random effects model
would treat neighborhood differences as realizations from a probability distribution—that
is, neighborhood intercepts would be allowed to vary randomly across neighborhoods
following a probability distribution (see multilevel models). An underlying
assumption is that the neighborhoods in the study are a random sample from
a larger population of neighborhoods about which inferences wish to be made.
Random effects models can be thought of as a particular case of the more
general multilevel models in which only intercepts are allowed to vary randomly
across groups (that is, random intercept models). Sometimes, however, the
term random effects models is used more broadly to refer to multilevel models
generally (that is, models that allow for both random intercept and random
covariate effects). See also random coefficient models.
Residual correlation
See non-independence of observations.
Sociologistic fallacy
An inferential fallacy that may arise from the failure to consider individual
level characteristics in drawing inferences regarding the causes of variability
across groups.1, 2 Although the level at which data are collected may fit
the conceptual model being investigated (that is, group level), important
facts pertaining to other levels (that is, the individual level) may have
been ignored.1 Suppose a researcher finds that communities with higher rates
of transient population have higher rates of schizophrenia, and he/she concludes
that higher rates of transient population lead to social disorganization,
breakdown of social networks, and increased risk of schizophrenia among all
community inhabitants. But suppose that schizophrenia rates are only increased
for transient residents (because transient residents tend to have fewer social
ties, and individuals with few social ties are at greater risk of developing
schizophrenia). That is, rates of schizophrenia are high for transient residents
and low for non-transient residents, regardless of whether they live in communities
with a high or a low proportion of transient residents. If this is the case,
the researcher would be committing the sociologistic fallacy in attributing
the higher schizophrenia rates to social disorganization affecting all community
members rather than to differences across communities in the percentage of
transient residents. See also psychologistic fallacy.
Structural variables
A type of group level variable that refers to relations or interactions between
members of a group,13 for example, characteristics of social networks within
the group or patterns of contacts or interactions between members of the
group. Structural variables are sometimes considered a subtype of integral
variables.12, 18
Subject specific models
Term used to refer to random effects/random coefficient models (or multilevel
models generally) in order to contrast them with population-average models. “Subject
specific” is used because the term was originally developed in the
context of longitudinal data analysis,46 where individuals or subjects are
the higher level units and repeat measures are the lower level units. In
this case, the fixed effects coefficients derived from a random effect, random
coefficient, or multilevel model are conditional on person level (or person
specific) random effects, hence the term “subject specific”.
More generally, they can be thought of as “higher level unit” specific
(or cluster specific), because they are conditional or higher level unit
(or cluster specific) random effects. For example, in the entry for multilevel
models, the estimate of 01 is conditional on group level random effects (as
reflected by the presence of Uoj and U1j).
Variance components
Using multilevel models the total variance in individual level outcomes (or
lower level outcomes generally) can be decomposed into variance within and
between groups (or higher level units generally). For example, the variance
in blood pressure across individuals can be decomposed into variance within
and between neighborhoods. These components are referred to as variance components.
The ability to estimate the variance components (which provide important
information on the variability in the outcome between and within groups)
is a key feature of multilevel models, and what distinguishes multilevel
models from traditional contextual effects models and population-average
models. For this reason, multilevel models have also sometimes been referred
to as variance component or covariance component models. See also multilevel
models.
References:
NOTE: References 1-38 were included in Part I and II of the Glossary, in Vol.
24, No. 3 (2003) and Vol. 24, No. 4 (2003) of the Epidemiological Bulletin.
(39) Wong G, Mason W. The hierarchical logistic regression model for multilevel
analysis. J Am Stat Assoc 1985;80:513–24.
(40) Diggle PJ, Liang KY, Zeger SL. Analysis of longitudinal data. New York:Oxford
University Press, 1994.
(41) Laird NM, Ware H. Random effects models for longitudinal data. Biometrics
1982;38:963–74.
(42) Longford NT. Random coefficient models. Oxford: Clarendon, 1982.
(43) Dempster AP, Rubin DB, Tsutakawa RK. Estimation in covariance components
models. J Am Stat Assoc 1981;76:341–56.
(44) Searle SR, Casella G, McCullogh CE. Variance components. New York: Wiley,
1992.
(45) Morris C, Christiansen C. Fitting Weibull duration models with random
effects. Lifetime Data Anal1995;1:347–59.
(46) Zeger S, Liang K, Albert P. Models for longitudinal data: a generalized
estimating equation approach. Biometrics 1988;44:1049–60.
(47) Burton P, Gurrin L, Sly P. Extending the simple linear regression model
for correlated responses: an introduction to generalized estimating equations
and multi-level mixed modeling. Stat Med1998;17:1261–91.
Return to index
Epidemiological
Bulletin , Vol. 25 No. 1, march 2004