Statistics How To

Rank-Biserial Correlation

Statistics Definitions > Rank-Biserial Correlation

What is Rank-Biserial Correlation?

Rank-biserial correlation (rank biserial r or rrb) is used when you want to find a correlation between dichotomous (binary) nominal data and ordinal (ranked) data. It is a special case of Somers’ D.

Formula

The formula (assuming there are no tied ranks) is:
rrb = 2 * (Y1 – Y0) / n.
Where:

  • n = number of data pairs in the sample,
  • Y0 = Y score means for data pairs with x = 0,
  • Y1 = Y score means for data pairs with x = 1.

For example, let’s say you had the following data:
Dichotomous variable: 1,1,1,0,1
Ordinal variable: 3,1,5,4,2

  • Y0 = 4 (only one ordinal variable is paired with 0).
  • Y1 = 3+1+5+2/4 = 11/4 = 2.75
  • n = 5

Giving a rank-biserial correlation coefficient of: 2 * (2.75 – 4)/6 = -0.21.

Running the Test

  • In SAS: Run the %BISERIAL macro.
  • In SPSS: Click Analyze → Correlate → Bivariate. Add your variables, deselect Pearson (the default) and click Spearman.* Click OK.

*Glass(1966) noted that the rank biserial correlation is appropriate to estimate Spearman correlation, so I’m assuming this works both ways. You should note though, that it is an estimate.

Matched Pairs Rank Biserial

Kerby (2014) suggested the following for calculating matched pairs rank biserial correlation:

  1. Run a Wilcoxon signed ranks test.
  2. Add the sum of negative ranks and the sum of positive ranks (from the output). This equals the total sum of ranks.
  3. Divide the sum of negative ranks by the total sum of ranks to get a proportion.
  4. Divide the sum of positive ranks by the total sum of ranks to get a proportion.
  5. Find the difference between the two proportions. This is the matched pairs rank biserial.

References:
Glass, G. V. (1966). Note on rank biserial correlation. Educational and Psychological Measurement, 26, 623-631. Retrieved Jan 1, 2017 from here: http://journals.sagepub.com/doi/pdf/10.1177/001316446602600307
Kerby, D. S. (2014). The simple difference formula: an approach to teaching nonparametric correlation. Innovative Teaching, 3(1), Article-1.
Willson, V. L. (1976). Critical values of the rank-biserial correlation coefficient. Educational and Psychological Measurement, Vol. 36, pp. 297-300.

If you prefer an online interactive environment to learn R and statistics, this free R Tutorial by Datacamp is a great way to get started. If you're are somewhat comfortable with R and are interested in going deeper into Statistics, try this Statistics with R track.

Comments are now closed for this post. Need help or want to post a correction? Please post a comment on our Facebook page and I'll do my best to help!
Rank-Biserial Correlation was last modified: October 12th, 2017 by Andale