What is a General Linear Model?
The General Linear Model (GLM) is a useful framework for comparing how several variables affect different continuous variables. In it’s simplest form, GLM is described as:
Data = Model + Error (Rutherford, 2001, p.3)
- In ANOVA, “data” is the dependent variable scores, the “error” the model is the experimental conditions, and the “error” is the part of the model not explained by the data.
- In regression analysis, the independent predictors make up the “model” and the residuals are the “error” component.
- ANCOVA is a blend of ANOVA and regression and so can also be described as Data = Model + Error.
Identical ProceduresIf you’re using software, the the same matrix algebra equation is used for all three. They all fall under the umbrella of “GLM”, even if you find them in separate menus or procedures. If you’re in the (now unusual) situation of calculating ANOVA, ANCOVA or regression analysis by hand, time-saving computations exist for each one. This gives the illusion that they are separate entities — when in fact they are practically the same procedure.
- = the dependent variable (also called the predicted, explanatory, or response variable).
- β0 = the intercept — always a constant (i.e. the value never changes within the model).
- β1 = a weight or slope (also called a coefficient). Determines how much weight one variable contributes to the model. If everything in the equation holds constant, β0 gives the predicted change in Y for a unit change in X.
- X = a variable.
If this looks familiar to the regression equation, that’s because they are one and the same. However, the key word in general linear model is general; the procedure can handle a wide variety of variables, including a non-numerical one. During the procedure, the GLM changes the non-numerical variable to a number before any calculations are made.
When the GLM βs (pronounced “betas”) are standardized with a mean of zero and a standard deviation of 1 (i.e. they are given z-scores), they are called beta weights. Otherwise, they are usually called Bs (as in the letter B in the English alphabet). The GLM equation with standardized βs is:
Emergence of the GLMM
Although many software packages still refer to certain procedures as “GLM”, the concept of a general linear model is seen by some as somewhat dated. It’s well recognized that the models can have non-linear components. There’s even some debate about the “general” part:
Calling it “general” seems quaint. It is certainly misleading ~ Stroup (2016).
Stroup prefers the term generalized linear mixed model (GLMM), of which GLM is a subtype. GLMMs combine GLMs with mixed models, which allow random effects models (GLMs only allow fixed effects). However, GLMM is a new approach:
GLMMs are still part of the statistical frontier, and not all of the answers about how to use them are known (even by experts) ~ Bolker.
Bolker, B. (2017). Draft PDF posted on website: http://ms.mcmaster.ca/~bolker/classes/s4c03/notes/GLMM_Bolker_draft5.pdf
Rutherford (2001). Introducing Anova and Ancova: A GLM Approach. SAGE.
Stroup, W. (2016). Generalized Linear Mixed Models: Modern Concepts, Methods and Applications. CRC Press.
If you prefer an online interactive environment to learn R and statistics, this free R Tutorial by Datacamp is a great way to get started. If you're are somewhat comfortable with R and are interested in going deeper into Statistics, try this Statistics with R track.Comments are now closed for this post. Need help or want to post a correction? Please post a comment on our Facebook page and I'll do my best to help!