Link function - Statistics How To

Statistics Definitions > Link function

A link function in a Generalized Linear Model maps a non-linear relationship to a linear one, which means you can fit a linear model to the data. More specifically, it connects the predictors in a model with the expected value of the response (dependent) variable in a linear way. The link function connects the random and systematic (non-random) components of a GLM: the random component specifies a probability distribution for X|Y while the systematic component relates a parameter η to predictors (inputs) X. The link function “links” these components [1].

Examples of Link Function

The link function for linear regression is the identity function. An identity function maps every element in a set to itself. In other words, the linear model directly predicts the outcome. Other regressions use different link functions to transform the data.

The link function for a normal distribution is also the identity function. Common link functions include:

Link function	Data type
Logit link	Binary data
Log link	Count data
Log log link	Data from extreme asymmetric distributions
Identity link	Continuous data
Complementary log-log (cloglog) link	Binary data
Inverse complementary log-log (icloglog) link	Binary data
Negative inverse	Non-negative and continuous
Probit link	Binary data
Power link	Continuous and count data

Which link function you use depends on the distribution of the response variable and the desired relationship between the predictors and the response variable. Some specific link functions and their uses:

Distribution	Support	Link name	Link function
Bernoulli	Integers: {0, 1}	Logit	XΒ = ln(μ / 1 – μ)
Binomial	integers: {0, 1, …, N}	Logit	XΒ = ln(μ / n – μ)
Normal Distribution	real: (-∞, ∞)	Identity	XΒ = μ
Exponential Distribution	real: (0, ∞)	Negative inverse	XΒ = -μ^-1
Gamma Distribution	real: (0, ∞)	Negative inverse	XΒ = -μ^-1
Poisson Distribution	integers: 0, 1, 2, …	Log	XΒ = ln(μ)

Watch this video on YouTube

References

Generalized Linear Models: Advanced Methods for Data Analysis (36-402/36-608). Spring 2014. Retrieved April 19, 2021 from: https://www.stat.cmu.edu/~ryantibs/advmethods/notes/glm.pdf