Statistics How To

Phi Coefficient (Mean Square Contingency Coefficient)

Correlation Coefficients > Phi Coefficient

What is the Phi Coefficient?

The Phi Coefficient is a measure of association between two binary variables (i.e. living/dead, black/white, success/failure). It is also called the Yule phi or Mean Square Contingency Coefficient and is used for contingency tables when:

correspondence analysis

A simple contingency table. Image: Michigan Dept. of Agriculture



For a 2×2 contingency table where a, b, c, and d represent the observation frequencies (the cell count). The formula for phi is:
phi coefficient

Example: Find phi for the following contingency table:
phi 2


Solution: Insert the counts into the formula and solve.
Φ = ad – bc / √((a + b)(c + d)(a + c)(b + d))
Φ = 14*13 – 10*6 / √((14 + 10)(6 + 13)(14 + 6)(10 + 13))
Φ = 182 – 60 / √((24)(19)(20)(23))
Φ = 122/ √((24)(19)(20)(23))
Φ = 122/ 458
Φ = 0.266.

Interpreting the Phi Coefficient

The phi coefficient is a symmetrical statistic, which means the independent variable and dependent variables are interchangeable.
The interpretation for the phi coefficient is similar to the Pearson Correlation Coefficient. The range is from -1 to 1, where:

  • 0 is no relationship.
  • 1 is a perfect positive relationship: most of your data falls along the diagonal cells.
  • -1 is a perfect negative relationship: most of your data is not on the diagonal.

The Political Science Department at Quinnipiac University posted this useful list of the meaning of Pearson’s Correlation coefficients. The same rule of thumb can be used for the Phi coefficient. Note that these are “crude estimates” for interpreting strengths of relationships.

r value =
+.70 or higher Very strong positive relationship
+.40 to +.69 Strong positive relationship
+.30 to +.39 Moderate positive relationship
+.20 to +.29 weak positive relationship
+.01 to +.19 No or negligible relationship
0 No relationship
-.01 to -.19 No or negligible relationship
-.20 to -.29 weak negative relationship
-.30 to -.39 Moderate negative relationship
-.40 to -.69 Strong negative relationship
-.70 or higher Very strong negative relationship

References:
Quinnipiac University. Maaning of Pearson’s. Retrieved Jun 20, 2016 from: http://faculty.quinnipiac.edu/libarts/polsci/Statistics.html
Yule, G. U. (1912). J. R. Statist. Soc., 75,
576–642. (On the methods of measuring the
association between two variables. The first
identification of the phi-coefficient.)

------------------------------------------------------------------------------

If you prefer an online interactive environment to learn R and statistics, this free R Tutorial by Datacamp is a great way to get started. If you're are somewhat comfortable with R and are interested in going deeper into Statistics, try this Statistics with R track.

Comments are now closed for this post. Need help or want to post a correction? Please post a comment on our Facebook page and I'll do my best to help!
Phi Coefficient (Mean Square Contingency Coefficient) was last modified: November 14th, 2017 by Stephanie Glen